SEQUENCE LISTING 

GENERAL INFORMATION: 
(i) APPLICANT: P. D. CHRISTIAN, K. H. J. GORDON and 
T. N. HANZLIK 



(ii) TITLE OF INVENTION: INSECT VIRUSES AND THEIR USES 
PROTECTING PLANTS 



(iii) NUMBER OF SEQUENCES: 53 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DORSEY & WHITNEY LLP 

(B) STREET: FOUR EMBARCADERO CENTER, SUITE 34 00 

(C) CITY: SAN FRANCISCO 

(D) STATE: CALIFORNIA 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) ZIP: 94111 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/677,653 

(B) FILING DATE: 3 OCTOBER 2000 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: RICHARD F. TRECARTIN 

(B) REGISTRATION NUMBER: 31,801 

(C) REFERENCE /DOCKET NUMBER: A-58631-4/RFT/NBC 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (415) 781-1989 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GGATCCACAG NNN 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

ATGGGCGATG CCGGCGTCGC GTTCACAG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

ATGGAGGATG CTGGAGTGGC GTCACAG 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
'(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

AT GAG C GAG G CCGGCGTCGC GTCACAG 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
CCATCGATGC CGGACTGGTA TCCCAGGGGG 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

CCATCGATGC CGGACTGGTA TCCCGAGGGA C 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CCATCGATGA TCCAGCCTCC TCGCGGCGCC GGATGGGCA 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCTCTAGATC CATTCGCCAT CCGAAGATGC CCATCCGGC 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCATCGATTT ATGCCGAGAA GGTAACCAGA GAAACACAC 

(2) INFORMATION FOR SEQ ID .NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GCTCTAGACC AGGTAATATA CCACAACGTG TGTTTCTCT 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 
-(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGGGGGAATT CATTTAGGTG AC AC T AT AG T TCTGCCTCCC CGGAC 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GGGGGGATCC TGGTATCCCA GGGGGGC 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCGGAAGCTT GTTTTTCTTT CTTTACCA 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GGGGGATCCG ATGGTATCCC GAGGGACGCT CAGCAGGTGG CATAGG 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) - SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AAATAATTTT GTTACTTTAG AAGGAGATAT ACATATGAGC GAGCGAGCAC AC 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AAATAATTTT GTTTAACCTT AAGAAGGAGA TCTACATATG CTGGAGTGGC GTCAC 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGAGATCTAC ATATGGGAGA TGCTGGAGTG 



(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

GTAGCGAACG TCGAGAA 

(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

GGGGGATCCT CAGTTGTCAG TGGCGGGGTA G 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GGGGATCCCT AATTGGCACG AGCGGCGC 



(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
.(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ. ID NO:21 

AATTACATAT GGCGGCCGCC GTTTCTGCC 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
AATTACATAT GTTCGCGGCC GCCGTTTCT 
(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Phe Ala Ala Ala Val Ser Ala Phe Ala Ala Asn Met Leu Ser Ser Val 
15 10 15 

Leu Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Pro Thr Leu Val Asp Gin Gly Phe Trp He Gly Gly Gin Tyr Ala Leu 
1 5 10 15 - 

Thr Pro Thr Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Phe Ala Ala Ala Val Ser 
1 5 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GCGCCCCCUG GGAUACCAGG AUC 23 

(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



TCAGCAGGTG GCATAGG 



(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 6.. 32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CCCAT ATG GGC GAT GCC GGC GTC GCG TCA CAG 
Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 6.. 32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CCCAT ATG AGC GAG GCC GGC GTC GCG TCA CAG 
Met Ser Glu Ala Gly Val Ala Ser Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - N-terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Ser Glu Ala Gly Val Ala Ser Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

ATG GGA GAT GCT GGA GTG GCG TCA CAG 
Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GGGGGATCCC GCGGATTTAT GAG C GAG 

(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGGGGATCCC GCGGAGACAT GAGCGAGCAC AC 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GGGGGATCCA GCGACATGAG AGATGCTGGA GTGG 34 

(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GGGGGATCCA GCGACATGAG AGATGCTGGA GTGG 34 



(2) INFORMATION FOR SEQ ID NO: 38: 
(i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

GGGGGATCCG TTCTGCCTCC CCGGAC 2 6 

(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

. (A) NAME /KEY : CDS 

(B) LOCATION : 37.. 5145 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACA ATG TAC GCG AAA GCG ACA 54 



Met Tyr Ala Lys Ala Thr 
1 5 



GAC 


GTG 


GCG 


CGT 


GTC 


TAC 


GCC 


GCG 


GCA 


GAT 


GTC GCC 


TAC 


GCG 


AAC 


GTA 


102 


Asp 


Val 


Ala 


Arg 


Val 


Tyr 


Ala 


Ala 


Ala 


Asp 


Val Ala 


Tyr Ala 


Asn 


Val 










10 










15 








20 








CTG 


CAG 


CAG 


AGA 


GCA 


GTC 


AAG 


TTG 


GAC 


TTC 


GCC CCG 


CCA 


CTG 


AAG 


GCA 


150 


Leu 


Gin 


Gin 


Arg 


Ala 


Val 


Lys 


Leu 


Asp 


Phe 


Ala Pro 


Pro 


Leu 


Lys 


Ala 








25 










30 








35 








CTA 


GAA 


ACC 


CTC 


CAC 


AGA 


CTG 


TAC 


TAT 


CCG 


CTG CGC 


TTC 


AAA 


GGG 


GGC 


198 


Leu 


Glu 


Thr 


Leu 


His 


Arg 


Leu 


Tyr 


Tyr 


Pro 


Leu Arg 


Phe 


Lys 


Gly Gly 






40 










45 








50 












ACT 


TTA 


CCC 


CCG 


ACA 


CAA 


CAC 


CCG 


ATC 


CTG 


GCC GGG 


CAC 


CAA 


CGT 


GTC 


246 


Thr 


Leu 


Pro 


Pro 


Thr 


Gin 


His 


Pro 


He 


Leu 


Ala Gly 


His 


Gin 


Arg 


Val 




55 










60 










65 






70 




GCA 


GAA 


GAG 


GTT 


CTG 


CAC 


AAT 


TTC 


GCC 


AGG 


GGA CGT 


AGC 


ACA 


GTG 


CTC 


294 


Ala 


Glu 


Glu 


Val 


Leu 
75 


His 


Asn 


Phe 


Ala 


Arg 
80 


Gly Arg 


Ser 


Thr 


Val 
85 


Leu 
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GAG 


ATA 


GGG 


CCG 


TCT 


CTG 


Glu 


He 


Gly 


Pro 


Ser 


Leu 








90 






AAC 


GCC 


CCC 


GTC 


GCA 


GAC 


Asn 


Ala 


Pro 


Val 


Ala 


Asp 






105 








GAC 


GGC 


TCG 


CGA 


CAC 


ATT 


Asp 


Gly 


Ser 


Arg 


His 


He 




120 










GGC 


CGG 


CCC 


GAG 


TTC 


AAG 


Gly Arg 


Pro 


Glu 


Phe 


Lys 


135 










140 


GCC 


TCC 


CGC 


ACC 


TTC 


TGC 


Ala 


Ser 


Arg 


Thr 


Phe 


Cys 










155 




TCG 


CGC 


GTT 


GGA 


ATT 


GCC 


Ser 


Arg 


Val 


Gly 


He 


Ala 








170 






GAG 


CTG 


GCC 


AAT 


GCG 


TTT 


Glu 


Leu 


Ala 


Asn 


Ala 


Phe 






185 








TTC 


ATG 


CAC 


ATG 


CCA 


GAA 


Phe 


Met 


His 


Met 


Pro 


Glu 




200 










GCC 


GAG 


CTC 


GGC 


TAC 


CGC 


Ala 


Glu 


Leu 


Gly 


Tyr Arg 


215 










220 


AAG 


GAC 


TGC 


GCA 


TTC 


CAG 


Lys 


Asp 


Cys 


Ala 


Phe 


Gin 










235 




TTG 


GAC 


TTC 


ATC 


AAC 


GAG 


Leu 


Asp 


Phe 


He 


Asn 


Glu 








250 






GCC 


CGC 


GGC 


TCC 


TAC 


TCC 


Ala 


Arg 


Gly 


Ser 


Tyr 


Ser 






265 








GAC 


TGG 


GGT 


GAT 


GCG 


TAC 


Asp 


Trp 


Gly 


Asp 


Ala 


Tyr 




280 










CTA 


CTG 


GTG 


AGG 


AAC 


TAC 


Leu 


Leu 


Val 


Arg 


Asn 


Tyr 


295 










300 


GAA 


GTC 


CAG 


AGG 


CGC 


CAC 


Glu 


Val 


Gin 


Arg 


Arg 


His 










315 




GCG 


CCA 


CCT 


GGA 


GAC 


CGC 


Ala 


Pro 


Pro 


Gly 


Asp Arg 








330 






GGC 


CTC 


TGC 


AGA 


ATC 


CCA 


Gly Leu 


Cys 


Arg 


He 


Pro 






345 








ACT 


GAG 


CAT 


AAG 


ACC 


ATC 


Thr 


Glu 


His 


Lys 


Thr 


He 




360 










CTC 


AAT 


TTT 


ATG 


CAA 


ACG 


Leu 


Asn 


Phe 


Met 


Gin 


Thr 


375 










380 



CAC 


AGC 


GCA 


CTT 


AAG 


CTA 


His 


Ser 


Ala 


Leu 


Lys 


Leu 






95 








TAT 


CAC 


GGG 


TGC 


ACC 


AAG 


Tyr 


His 


Gly Cys 


Thr 


Lys 




110 










ACG 


GCC 


TTA 


GAG 


TCT 


AGA 


Thr 


Ala 


Leu 


Glu 


Ser 


Arg 


125 










130 


GCC 


GAC 


GCC 


TCA 


CTG 


CTC 


Ala Asp Ala 


Ser 


Leu 


Leu 










145 




GTC 


GAC 


GGA 


GTC 


GGC 


TCT 


Val 


Asp Gly Val 


Gly Ser 








160 






AAT 


CAC 


TCC 


CTC 


TAT 


GAC 


Asn 


His 


Ser 


Leu 


Tyr 


Asp 






175 








GAG 


AAC 


CAC 


GGA 


CTT 


CAC 


Glu 


Asn 


His 


Gly 


Leu 


His 




190 










GAG 


CTG 


CTC 


TAC 


ATG 


GAC 


Glu 


Leu 


Leu 


Tyr 


Met 


Asp 


205 










210 


TTC 


CAC 


GTT 


ATT 


GAA 


GAG 


Phe" 


His 


Val 


He 


Glu 


Glu 










225 




GGG 


GGG 


GAC 


CTC 


CGT 


CTC 


Gly Gly Asp Leu 


Arg 


Leu 








240 






AGC 


CAA 


GAG 


CGG 


CGC 


ATC 


Ser 


Gin 


Glu 


Arg 


Arg 


He 






255 








AGA 


CGC 


GCC 


GTC 


ATT 


TTC 


Arg 


Arg 


Ala 


Val 


He 


Phe 




270 










TTA 


CAC 


GAC 


TTC 


CAC 


ACA 


Leu 


His 


Asp 


Phe 


His 


Thr 


285 










290 


CCC 


ACT 


CCG 


TTT 


GGT 


TTC 


Pro 


Thr 


Pro 


Phe 


Gly 


Phe 










305 




GGC 


TCC 


AGC 


ATT 


GAG 


CTG 


Gly 


Ser 


Ser 


He 


Glu 


Leu 








320 






ATG 


CTG 


GCC 


GTC 


GTC 


CCA 


Met 


Leu 


Ala 


Val 


Val 


Pro 






335 








AAC 


ATC 


TTT 


TAT 


TAC 


GCC 


Asn 


He 


Phe 


Tyr 


Tyr Ala 




350 










CTT 


ACG 


TCA 


CAG 


CAC 


AAA 


Leu 


Thr 


Ser 


Gin 


His 


Lys 


365 










370 


CGT 


CCT 


GAG 


AAG 


GAA 


CTA 


Arg 


Pro 


Glu 


Lys 


Glu 


Leu 










385 





CAT 


GGG 


GCA 


CCG 


342 


His 


Gly Ala 


Pro 






100 








TAC 


GGC 


ACC 


CGC 


390 


Tyr 


Gly Thr Arg 




115 










TCC 


GTC 


GCC 


ACA 


438 


Ser 


Val 


Ala 


Thr 




GCC 


AAC 


GGC 


ATT 


486 


Ala 


Asn 


Gly 


He 










150 




TGC 


GCG 


TTC 


AAA 


534 


Cys 


Ala 


Phe 


Lys 








165 






GTG 


ACC 


CTA 


GAG 


582 


Val 


Thr 


Leu 


Glu 






180 








ATG 


GTC 


CGC 


GCG 


630 


Met 


Val Arg Ala 




195 










AAC 


GTG 


GTT 


AAT 


678 


Asn 


Val 


Val 


Asn 




CCT 


ATG 


GCT 


GTG 


726 


Pro 


Met 


Ala 


Val 










230 




CAC 


TTC 


CCT 


GAG 


774 


His 


Phe 


Pro 


Glu 








245 






GAG 


AGG 


CTG 


GCC 


822 


Glu 


Arg 


Leu 


Ala 






260 








TCC 


GGC 


GAC 


GAC 


870 


Ser 


Gly Asp Asp 




275 










TGG 


CTC 


GCC 


TAC 


918 


Trp 


Leu 


Ala 


Tyr 




TCA 


CTC 


CAT 


ATA 


966 


Ser 


Leu 


His 


He 










310 




CGC 


ATC 


ACT 


CGC 


1014 


Arg 


He 


Thr Arg 








325 






AGG 


ACG 


TCC 


CAA 


1062 


Arg 


Thr 


Ser 


Gin 






340 








GAC 


GCG 


TCG 


GGC 


1110 


Asp 


Ala 


Ser Gly 




355 










GTC 


AAC 


ATG 


CTG 


1158 


Val 


Asn 


Met 


Leu 




GTC 


GAC 


ATG 


ACC 


1206 


Val 


Asp Met 


Thr 





390 
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GTC TTG ATG TCG 
Val Leu Met Ser 

TCA GAA GTC ACC 
Ser Glu Val Thr 
410 

CGC ACT GTC GTG 
Arg Thr Val Val 
425 

GCT GCG GTC GCT 
Ala Ala Val Ala 
440 

TCG TTC TGG GAG 
Ser Phe Trp Glu 
455 

CGC AAC CTC AAA 
Arg Asn Leu Lys 

AAG TAC CGA GTC 
Lys Tyr Arg Val 
490 

TCC CCT GAA CAG 
Ser Pro Glu Gin 
505 

GTC TTT CAC GAC 
Val Phe His Asp 
520 

TAC AAC GAA CGT 
Tyr Asn Glu Arg 
535 

GGT TTC GAC GCC 
Gly Phe Asp Ala 

TAC CGC GCC ACC 
Tyr Arg Ala Thr 
570 

AAG ATC ACC CTC 
Lys lie Thr Leu 
585 

TCC CCA GTC AGA 
Ser Pro Val Arg 
600 

AAG ACG GGC ACC 
Lys Thr Gly Thr 
615 

CTT TAC GTG GCA 
Leu Tyr Val Ala 

ATC AAA CCG CCG 
lie Lys Pro Pro 
650 

CTC CGT CGT GCC 
Leu Arg Arg Ala 
665 

GAC GAG TGC TTC 
Asp Glu Cys Phe 
680 



TTC GCG CGC GCT 
Phe Ala Arg Ala 
395 

GAG AGC TCC TGG 
Glu Ser Ser Trp 

TCT CTT TAC GTC 
Ser Leu Tyr Val 
430 

GTC AAG ACC GCC 
Val Lys Thr Ala 
445 

AGT CTC AAG CAC 
Ser Leu Lys His 
460 

GGC ACC GAC GTC 
Gly Thr Asp Val 
475 

CAC TCG CTC GGA 
His Ser Leu Gly 

GTC GGC TTC CTG 
Val Gly Phe Leu 
510 

AGG GAA GAG CTT 
Arg Glu Glu Leu 
525 

CCG GTA CCT TCC 
Pro Val Pro Ser 
540 

GAC TTG TGG CAC 
Asp Leu Trp His 
555 

TTG CAG GCA GGT 
Leu Gin Ala Gly 

GAG AAC GCC CTC 
Glu Asn Ala Leu 
590 

GGC CTC GAG ATG 
Gly Leu Glu Met 
605 

CTC ATC GCC GCC 
Leu lie Ala Ala 
620 

CCC ACC AGA GAA 
Pro Thr Arg Glu 
635 

TCC GCC TCG GCT 
Ser Ala Ser Ala 

ACC GCC GAG GGC 
Thr Ala Glu Gly 
670 

ATG TTC CCG CTC 
Met Phe Pro Leu 
685 



AGG CTG CGC GCG 
Arg Leu Arg Ala 
400 

AAC ATC TCA CCG 
Asn lie Ser Pro 
415 

CTC CAC ATC ATC 
Leu His He He 

AAG GAC GAC GTC 
Lys Asp Asp Val 
. 450 

GTC TTG GGC TCC 
Val Leu Gly Ser 
465 

GTC TTT ACT AAG 
Val Phe Thr Lys 
480 

GAC ATA ATC TGC 
Asp He He Cys 
495 

CCG TCC CGC GTA 
Pro Ser Arg Val 

GAG GTC CTT CGC 
Glu Val Leu. Arg 
530 

ACT CCT CCT GTG 
Thr Pro Pro Val 
545 

GCG ACC GCG GCC 
Ala Thr Ala Ala 
560 

CTC AAC ACC GAC 
Leu Asn Thr Asp 
575 

AAG ACC ATC GAC 
Lys Thr He Asp 

TAC GAG GGC CCG 
Tyr Glu Gly Pro 
610 

CTT GAG GCC GCG 
Leu Glu Ala Ala 
625 

CTG AGA GAG GCT 
Leu Arg Glu Ala 
640 

ACG CAA CAT GTC 
Thr Gin His Val 
655 

GCC CCT TTC GCT 
Ala Pro Phe Ala 

GTG TAC GTC GCG 
Val Tyr Val Ala 
690 



ATC GTG GTC GCC 
He Val Val Ala 
405 

GCT GAC CTG GTC 
Ala Asp Leu Val 
420 

GAG CGC CGA AGG 
Glu Arg Arg Arg 
435 

TTT GGA GAG ACT 
Phe Gly Glu Thr 

TGT TGC GGT CTG 
Cys Cys Gly Leu 
470 

CGC GTC GTC GAT 
Arg Val Val Asp 
4 85 

GAC GTC CGC CTG 
Asp Val Arg Leu 
500 

CCA CCT GCC CGC 
Pro Pro Ala Arg 
515 

GAA GCT GGC TGC 
Glu Ala Gly Cys 

GAG GAG CCC CAA 
Glu Glu Pro Gin 
550 

TCA CTC CCC GAG 
Ser Leu Pro Glu 
565 

GTC AAG CAG CTC 
Val Lys Gin Leu 
580 

GGG CTC ACC CTC 
Gly Leu Thr Leu 
595 

CCA GGC AGC GGC 
Pro Gly Ser Gly 

GGC GGT AAA GCA 
Gly Gly Lys Ala 
630 

ATG GAC CGG CGG 
Met Asp Arg Arg 
645 

GCC CTT GCG ATT 
Ala Leu Ala He 
660 

ACC GTG GTT ATC 
Thr Val Val He 
675 

ATC GTG CAC GCC 
He Val His Ala 
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TTG TCC CCG AGC TCA CGA ATA GTC CTT GTA GGG GAC GTC CAC CAA ATC 2166 

Leu Ser Pro Ser Ser Arg He Val Leu Val Gly Asp Val His Gin He 

695 700 705 710 

GGG TTT ATA GAC TTC CAA GGC ACA AGC GCG AAC ATG CCG CTC GTT CGC 2214 

Gly Phe He Asp Phe Gin Gly Thr Ser Ala Asn Met Pro Leu Val Arg 

715 720 725 

GAC GTC GTT AAG CAG TGC CGT CGG CGC ACT TTC AAC CAA ACC AAG CGC 2262 
Asp Val Val Lys Gin Cys Arg Arg Arg Thr Phe Asn Gin Thr Lys Arg 

730 735 740 

TGT CCG GCC GAC GTC GTT GCC ACC ACG TTT TTC CAG AGC TTG TAC CCC 2310 
Cys Pro Ala Asp Val Val Ala Thr Thr Phe Phe Gin Ser Leu Tyr Pro 

745 750 755 

GGG TGC ACA ACC ACC TCA GGG TGC GTC GCA TCC ATC AGC CAC GTC GCC 2358 
Gly Cys Thr Thr Thr Ser Gly Cys Val Ala Ser He Ser His Val Ala 

760 765 770 

CCA GAC TAC CGC AAC AGC CAG GCG CAA ACG CTC TGC TTC ACG CAG GAG 24 06 

Pro Asp Tyr Arg Asn Ser Gin Ala Gin Thr Leu Cys Phe Thr Gin Glu 
775 780 785 790 

GAA AAG TCG CGC CAC GGG GCT GAG GGC GCG ATG ACT GTG CAC GAA GCG 24 54 

Glu Lys Ser Arg His Gly Ala Glu Gly Ala Met Thr Val' His Glu Ala 

795 800 805 

CAG GGA CGC ACT TTT GCG TCT GTC ATT CTG CAT TAC AAC GGC TCC ACA 2502 
Gin Gly Arg Thr Phe Ala Ser Val He Leu His Tyr Asn Gly Ser Thr 

810 815 820 

GCA GAG CAG AAG CTC CTC GCT GAG AAG TCG CAC CTT CTA GTC GGC ATC 2550 
Ala Glu Gin Lys Leu Leu Ala- Glu Lys Ser His Leu Leu Val Gly He 

825 830 835 

ACG CGC CAC ACC AAC CAC CTG TAC ATC CGC GAC CCG ACA GGT GAC ATT 2598 
Thr Arg His Thr Asn His Leu Tyr He Arg Asp Pro Thr Gly Asp He 

840 845 850 

GAG AGA CAA CTC AAC CAT AGC GCG AAA GCC GAG GTG TTT ACA GAC ATC 264 6 

Glu Arg Gin Leu Asn His Ser Ala Lys Ala Glu Val Phe Thr Asp He 
855 860 865 870 

CCT GCA CCC CTG GAG ATC ACG ACT GTC AAA CCG AGT GAA GAG GTG CAG 2 694 

Pro Ala Pro Leu Glu He Thr Thr Val Lys Pro Ser Glu Glu Val Gin 

875 880 885 

CGC AAC GAA GTG ATG GCA ACG ATA CCC CCG CAG AGT GCC ACG CCG CAC 2742 
Arg Asn Glu Val Met Ala Thr He Pro Pro Gin Ser Ala Thr Pro His 

890 895 900 

GGA GCA ATC CAT CTG CTC CGC AAG AAC TTC GGG GAC CAA CCC GAC TGT 27 90 

Gly Ala He His Leu Leu Arg Lys Asn Phe Gly Asp Gin Pro Asp Cys 

905 910 915 

GGC TGT GTC GCT TTG GCG AAG ACC GGC TAC GAG GTG TTT GGC GGT CGT 2838 
Gly Cys Val Ala Leu Ala Lys Thr Gly Tyr Glu Val Phe Gly Gly Arg 

920 925 930 

GCC AAA ATC AAC GTA GAG CTT GCC GAA CCC GAC GCG ACC CCG AAG CCG 2886 
Ala Lys He Asn Val Glu Leu Ala Glu Pro Asp Ala Thr Pro Lys Pro 
935 940 945 950 

CAT AGG GCG TTC CAG GAA GGG GTA CAG TGG GTC AAG GTC ACC AAC GCG 2934 
His Arg Ala Phe Gin Glu Gly Val Gin Trp Val Lys Val Thr Asn Ala 

955 960 965 

TCT AAC AAA CAC CAG GCG CTC CAG ACG CTG TTG TCC CGC TAC ACC AAG 2982 
Ser Asn Lys His Gin Ala Leu Gin Thr Leu Leu Ser Arg Tyr Thr Lys 

970 975 980 

CGA AGC GCT GAC CTG CCG CTA CAC GAA GCT AAG GAG GAC GTC AAA CGC 3030 
Arg Ser Ala Asp Leu Pro Leu His Glu Ala Lys Glu Asp Val Lys Arg 
985 990 995 
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ATG 


CTA 


AAC 


TCG 


CTT 


GAC 


CGA 


CAT 


TGG 


GAC 


TGG 


ACT 


GTC 


ACT 


GAA 


GAC 


3078 


Met 


Leu 


Asn 


Ser 


Leu 


Asp Arg 


His 


Trp Asp 


Trp 


Thr 


Val 


Thr 


Glu 


Asp 






1000 








1005 








1010 








GCC 


CGT 


GAC 


CGA 


GCT 


GTC 


TTC 


GAG 


ACC 


CAG 


CTC 


AAG 


TTC 


ACC 


CAA 


CGC 


3126 


Ala 


Arg 


Asp 


Arg Ala 


Val 


Phe 


Glu 


Thr 


Gin 


Leu 


Lys 


Phe 


Thr 


Gin 


Arg 




1015 








1020 








1025 








1030 




GGC 


GGC 


ACC 


GTC 


GAA 


GAC 


CTG 


CTG 


GAG 


CCA 


GAC 


GAC 


CCC 


TAC 


ATC 


CGT 


3174 


Gly Gly 


Thr 


Val 


Glu 


Asp 


Leu 


Leu 


Glu 


Pro 


Asp 


Asp 


Pro 


Tyr 


He 


Arg 












1035 








1040 








1045 




GAC 


ATA 


GAC 


TTC 


CTT 


ATG 


AAG 


ACT 


CAG 


CAG 


AAA 


GTG 


TCG 


CCC 


AAG 


CCG 


3222 


Asp 


He 


Asp 


Phe 


Leu 


Met 


Lys 


Thr 


Gin 


Gin 


Lys 


Val 


Ser 


Pro 


Lys 


Pro 










1050 








1055 








1060 






ATC 


AAT 


ACG 


GGC 


AAG 


GTC 


GGG 


CAG 


GGG 


ATC 


GCC 


GCT 


CAC 


TCA 


AAG 


TCT 


3270 


He 


Asn 


Thr 


Gly 


Lys 


Val 


Gly 


Gin 


Gly 


He 


Ala 


Ala 


His 


Ser 


Lys 


Ser 








1065 








1070 








1075 






CTC 


AAC 


TTC 


GTC 


CTC 


GCC 


GCT 


TGG 


ATA 


CGC 


ATA 


CTC 


GAG 


GAG 


ATA 


CTC 


3318 


Leu 


Asn 


Phe 


Val 


Leu 


Ala 


Ala 


Trp 


lie 


Arg 


He 


Leu 


Glu 


Glu 


He 


Leu 






1080 








1085 








1090 










CGT 


ACC 


GGG 


AGC 


CGC 


ACG 


GTC 


CGG 


TAC 


AGC 


AAC 


GGT 


CTC 


CCC 


GAC 


GAA 


3366 


Arg 


Thr 


Gly 


Ser 


Arg 


Thr 


Val 


Arg 


Tyr 


Ser 


Asn 


Gly 


Leu 


Pro 


Asp 


Glu 




1095 








1100 








1105 






1110 




GAA 


GAG 


GCC 


ATG 


CTG 


CTC 


GAA 


GCG 


AAG 


ATC 


AAT 


CAA 


GTC 


CCA 


CAC 


GCC 


3414 


Glu 


Glu 


Ala 


Met 


Leu 


Leu 


Glu 


Ala 


Lys 


He 


Asn 


Gin 


Val 


Pro 


His 


Ala 












1115 








1120 








1125 




ACG 


TTC 


GTC 


TCG 


GCG 


GAC 


TGG 


ACC 


GAG 


TTT 


GAC 


ACC 


GCC 


CAC 


AAT 


AAC 


3462 


Thr 


Phe 


Val 


Ser 


Ala 


Asp 


Trp- 


Thr 


Glu 


Phe 


Asp 


Thr 


Ala 


His 


Asn 


Asn 










1130 








1135 








1140 






ACG 


AGT 


GAG 


CTG 


CTC 


TTC 


GCC 


GCC 


CTT 


TTA 


GAG 


CGC 


ATC 


GGC 


ACG 


CCT 


3510 


Thr 


Ser 


Glu 


Leu 


Leu 


Phe 


Ala 


Ala 


Leu 


Leu 


Glu 


Arg 


He 


Gly Thr 


Pro 








1145 








1150 








1155 








GCA 


GCT 


GCC 


GTT 


AAT 


CTA 


TTC 


AGA 


GAA 


CGG 


TGT 


GGG 


AAA 


CGC 


ACC 


TTG 


3558 


Ala 


Ala 


Ala 


Val 


Asn 


Leu 


Phe 


Arg 


Glu 


Arg 


Cys 


Gly 


Lys 


Arg 


Thr 


Leu 






1160 








1165 








1170 










CGA 


GCG 


AAG 


GGT 


CTA 


GGC 


TCC 


GTT 


GAA 


GTC 


GAC 


GGT 


CTG 


CTC 


GAC 


TCC 


3606 


Arg 


Ala 


Lys 


Gly 


Leu 


Gly 


Ser 


Val 


Glu 


Val 


Asp 


Gly 


Leu 


Leu 


Asp 


Ser 




1175 








1180 








1185 






1190 




GGC 


GCA 


GCT 


.TGG 


ACG 


CCT 


TGC 


CGC 


AAC 


ACC 


ATC 


TTC 


TCT 


GCC 


GCC 


GTC 


3654 


Gly Ala 


Ala 


Trp 


Thr 


Pro 


Cys 


Arg 


Asn 


Thr 


He 


Phe 


Ser 


Ala 


Ala 


Val 












1195 








1200 








1205 




ATG 


CTC 


ACG 


CTC 


TTC 


CGC 


GGC 


GTC 


AAG 


TTC 


GCA 


GCT 


TTC 


AAA 


GGC 


GAC 


3702 


Met 


Leu 


Thr 


Leu 


Phe 


Arg 


Gly 


Val 


Lys 


Phe 


Ala 


Ala 


Phe 


Lys 


Gly 

1 


Asp 










1210 








1215 








122C 




GAC 


TCG 


CTC 


CTC 


TGT 


GGT 


AGC 


CAT 


TAC 


CTC 


CGT 


TTC 


GAC 


GCT 


AGC 


CGC 


3750 


Asp 


Ser 


Leu 


Leu 


Cys 


Gly 


Ser 


His 


Tyr 


Leu 


Arg 


Phe 


Asp 


Ala 


Ser 


Arg 








1225 








1230 








1235 






CTT 


CAC 


ATG 


GGC 


GAA 


CGT 


TAC 


AAG 


ACC 


AAA 


CAT 


TTG 


AAG 


GTC 


GAG 


GTG 


3798 


Leu 


His 


Met 


Gly Glu 


Arg 


Tyr 


Lys 


Thr 


Lys 


His 


Leu 


Lys 

l 


Val 


Glu 


Val 






1240 








1245 








125C 










CAG 


AAA 


ATC 


GTG 


CCG 


TAC 


ATC 


GGA 


CTC 


CTC 


GTC 


TCC 


GCT 


GAG 


CAG 


GTC 


3846 


Gin 


Lys 


He 


Val 


Pro 


Tyr 


He 


Gly 


Leu 


Leu 


Val 


Ser 


Ala 


Glu 


Gin 


Val 




1255 








1260 








1265 








1270 




GTC 


CTC 


GAC 


CCT 


GTC 


AGG 


AGC 


GCT 


CTC 


AAG 


ATA 


TTT 


GGG 


CGC 


TGC 


TAC 


3894 


Val 


Leu Asp 


Pro 


Val 


Arg 


Ser 


Ala 


Leu 


Lys 


He 


Phe 


Gly 


Arg 


Cys 


Tyr 












1275 








1280 






1285 




ACA 


AGC 


GAA 


CTC 


CTT 


TAC 


TCC 


AAG 


TAC 


GTG 


GAG 


GCT 


GTG 


AGA 


GAC 


ATC 


3942 


Thr 


Ser 


Glu 


Leu 


Leu 


Tyr 


Ser 


Lys 


Tyr 


Val 


Glu 


Ala 


Val 


Arg 


Asp 


He 










1290 










1295 








1300 
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ACC 


AAG 


GGC 


TGG 


AGT 


GAC 


GCC 


CGC 


TAC 


CAC 


AGC 


CTC 


CTG 


TGC 


CAC 


ATG 


3990 


Thr 


Lys 


Gly Trp 


Ser 


Asp 


Ala 


Arg 


Tyr 


His 


Ser 


Leu 


Leu 


Cys 


His 


Met 








1305 








1310 








1315 








TCA 


GCA 


TGC 


TAC 


TAC 


AAT 


TAC 


GCG 


CCG 


GAG 


TCT 


GCG 


GCG 


TAC 


ATC 


ATC 


4038 


Ser 


Ala 


Cys 


Tyr 


Tyr 


Asn 


Tyr Ala 


Pro 


Glu 


Ser 


Ala 


Ala 


Tyr 


He 


He 






1320 








1325 








1330 










GAC 


GCT 


GTT 


GTT 


CGC 


TTT 


GGG 


CGC 


GGC 


GAC 


TTC 


CCG 


TTT 


GAA 


CAA 


CTG 


4086 


Asp Ala 


Val 


Val 


Arg 


Phe 


Gly 


Arg 


Gly 


Asp 


Phe 


Pro 


Phe 


Glu 


Gin 


Leu 




1335 








1340 








1345 








1350 




CGC 


GTG 


GTG 


CGT 


GCC 


CAT 


GTG 


CAG 


GCA 


CCC 


GAC 


GCT 


TAC 


AGC 


AGC 


ACG 


4134 


Arg 


Val 


Val 


Arg 


Ala 


His 


Val 


Gin 


Ala 


Pro Asp Ala 


Tyr 


Ser 


Ser 


Thr 












1355 








1360 








1365 




TAT 


CCG 


GCT 


AAC 


GTG 


CGC 


GCA 


TCG 


TGC 


CTT 


GAC 


CAC 


GTC 


TTC 


GAG 


CCC 


4182 


Tyr 


Pro 


Ala 


Asn 


Val 


Arg 


Ala 


Ser 


Cys 


Leu 


Asp 


His 


Val 


Phe 


Glu 


Pro 










1370 








1375 








1380 






CGC 


CAG 


GCC 


GCC 


GCC 


CCG 


GCA 


GGT 


TTC 


GTT 


GCG 


ACA 


TGT 


GCG 


AAG 


CCG 


4230 


Arg 


Gin 


Ala 


Ala 


Ala 


Pro 


Ala 


Gly 


Phe 


Val 


Ala 


Thr 


Cys 


Ala 


Lys 


Pro 








1385 








1390 








1395 








GAA 


ACG 


CCT 


TCT 


TCA 


CTT 


ACC 


GCG 


AAA 


GCT 


GGT 


GTT 


TCT 


GCG 


ACT 


ACA 


4278 


Glu 


Thr 


Pro 


Ser 


Ser 


Leu 


Thr 


Ala 


Lys 


Ala 


Gly 


Val 


Ser 


'Ala 


Thr 


Thr 






1400 








1405 








1410 










AGC 


CAC 


GTT 


GCG 


ACT 


GGG 


ACT 


GCG 


CCC 


CCG 


GAG 


TCT 


CCA 


TGG 


GAT 


GCA 


4326 


Ser 


His 


Val 


Ala 


Thr 


Gly 


Thr 


Ala 


Pro 


Pro 


Glu 


Ser 


Pro 


Trp 


Asp 


Ala 




1415 








1420 








1425 








1430 




CCT 


GCA 


GCC 


AAC 


AGC 


TTT 


TCG 


GAG 


TTA 


TTG 


ACA 


CCG 


GAG 


ACC 


CCG 


TCC 


4374 


Pro 


Ala 


Ala 


Asn 


Ser 


Phe 


Ser 


Glu 


Leu 


Leu 


Thr 


Pro 


Glu 


Thr 


Pro 


Ser 












1435 








1440 








1445 




ACA 


TCA 


TCC 


TCG 


CCG 


TCA 


TCG 


TCT 


TCA 


TCG 


GAC 


TCC 


TCT 


ACA 


TCG 


TGT 


4422 


Thr 


Ser 


Ser 


Ser 


Pro 


Ser 


Ser 


Ser 


Ser 


Ser 


Asp 


Ser 


Ser 


Thr 


Ser 


Cys 










1450 








1455 








1460 






GGA 


AGG 


TCG 


CTC 


AGT 


GGT 


GGA 


GAC 


ACC 


GCA 


AGG 


ACC 


ACA 


GAA 


GAC 


TTG 


4470 


Gly Arg 


Ser 


Leu 


Ser 


Gly 


Gly Asp 


Thr 


Ala 


Arg 


Thr 


Thr 


Glu 


Asp 


Leu 








1465 








1470 








1475 








AAC 


AGC 


AGA 


AAG 


CCG 


CCT 


TCG 


CAA 


GAC 


AGG 


CAA 


TCA 


CGC 


TCG 


TCT 


GAA 


4518 


Asn 


Ser 


Arg 


Lys 


Pro 


Pro 


Ser 


Gin 


Asp 


Arg 


Gin 


Ser 


Arg 


Ser 


Ser 


Glu 






1480 








1485 








1490 










TGT 


CTG 


GAC 


AGA 


AGC 


GGA 


GAA 


AGG 


ACA 


GGC 


AGT 


TCG 


TTA 


ACT 


GCC 


CCC 


4566 


Cys 


Leu 


Asp 


Arg 


Ser 


Gly 


Glu 


Arg 


Thr 


Gly 


Ser 


Ser 


Leu 


Thr 


Ala 


Pro 




1495 








1500 








1505 








1510 




ACT 


GCT 


CCG 


AGC 


CCC 


TCA 


TTC 


TCA 


TTT 


TCG 


GAA 


AGA 


GCT 


CGA 


CTG 


GCG 


4614 


Thr 


Ala 


Pro 


Ser 


Pro 


Ser 


Phe 


Ser 


Phe 


Ser 


Glu 


Arg 


Ala 


Arg 


Leu 


Ala 












1515 








1520 








1525 




ACC 


GGG 


CCG 


ACT 


GTC 


GCC 


GCT 


GCG 


ACA 


TCA 


CCT 


TCG 


GCA 


ACC 


CCA 


TCC 


4662 


Thr 


Gly 


Pro 


Thr 


Val 


Ala 


Ala 


Ala 


Thr 


Ser 


Pro 


Ser 


Ala 


Thr 


Pro 


Ser 










1530 








1535 








1540 






TGC 


GCC 


ACG 


GAC 


CAG 


GTT 


GCC 


GCG 


AGG 


ACC 


ACG 


CCG 


GAC 


TTT 


GCG 


CCT 


4710 


Cys 


Ala 


Thr 


Asp 


Gin 


Val 


Ala 


Ala 


Arg 


Thr 


Thr 


Pro 


Asp 


Phe 


Ala 


Pro 








1545 








1550 








1555 








TTC 


CTG 


GGT 


TCC 


CAG 


TCT 


GCC 


CGT 


GCT 


GTC 


TCG 


AAG 


CCG 


TAC 


CGG 


CCC 


4758 


Phe 


Leu 


Gly 


Ser 


Gin 


Ser 


Ala 


Arg Ala 


Val 


Ser 


Lys 


Pro 


Tyr 


Arg 


Pro 






1560 








1565 








1570 






CCC 


ACG 


ACT 


GCC 


CGT 


TGG 


AAA 


GAA 


GTC 


ACC 


CCG 


CTC 


CAC 


GCG 


TGG 


AAG 


4806 


Pro 


Thr 


Thr 


Ala 


Arg 


Trp 


Lys 


Glu 


Val 


Thr 


Pro 


Leu 


His 


Ala 


Trp 


Lys 




1575 








1580 








1585 






1590 




GGC 


GTG 


ACC 


GGA 


GAC 


CGA 


CCG 


GAA 


GTC 


AGG 


GAG 


GAC 


CCG 


GAG 


ACA 


GCG 


4854 


Gly Val 


Thr 


Gly 


Asp 


Arg 


Pro 


Glu 


Val 


Arg 


Glu 


Asp 


Pro 


Glu 


Thr 


Ala 












1595 








1600 








1605 
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GCG 


GTC 


GTC 


CAG 


GCT 


CTG 


ATC 


AGC 


GGC 


CGT 


TAT 


CCT 


CAG 


AAG 


ACG 


AAG 


4902 


Ala 


Val 


Val 


Gin 


Ala 


Leu 


He 


Ser 


Gly Arg 


Tyr 


Pro 


Gin 


Lys 


Thr 


Lys 










1610 








1615 








1620 






CTT 


TCC 


TCC 


GAC 


GCA 


TCC 


AAA 


GGC 


TAC 


TCA 


AGA 


ACT 


AAG 


GGA 


TGC 


TCA 


4950 


Leu 


Ser 


Ser 


Asp 


Ala 


Ser 


Lys 


Gly Tyr 


Ser 


Arg 


Thr 


Lys 


Gly 


Cys 


Ser 








1625 








1630 








1635 








CAA 


TCC 


ACC 


TCT 


TTT 


CCT 


GCC 


CCG 


AGT 


GCG 


GAT 


TAC 


CAG 


GCC 


CGC 


GAC 


4998 


Gin 


Ser 


Thr 


Ser 


Phe 


Pro 


Ala 


Pro 


Ser 


Ala 


Asp 


Tyr 


Gin 


Ala 


Arg 


Asp 






1640 








1645 








1650 






TGC 


CAG 


ACA 


GTC 


CGA 


GTC 


TGC 


CGC 


GCC 


GCT 


GCA 


GAG 


ATG 


GCG 


CGC 


TCA 


5046 


Cys 


Gin 


Thr 


Val 


Arg 


Val 


Cys 


Arg 


Ala 


Ala 


Ala 


Glu 


Met 


Ala 


Arg 


Ser 




1655 








1660 








1665 






1670 




TGT 


ATT 


CAC 


GAG 


CCG 


TTG 


GCT 


TCA 


TCT 


GCC 


GCC 


AGT 


GCC 


GAC 


TTG 


AAG 


5094 


Cys 


lie 


His 


Glu 


Pro 


Leu 


Ala 


Ser 


Ser 


Ala 


Ala 


Ser 


Ala 


Asp 


Leu 


Lys 












1675 








1680 






1685 




CGC 


ATA 


CGC 


TCT 


ACC 


TCG 


GAC 


TCT 


GTT 


CCC 


GAT 


GTA 


AAG 


ATC 


AGC 


AAG 


5142 


Arg 


He 


Arg 


Ser 


Thr 


Ser 


Asp 


Ser 


Val 


Pro 


Asp 


Val 


Lys 


He 


Ser 


Lys 










1690 








1695 








1700 






AGC 


GCA 


TGAAGGAACA AAATTAGTTT CCTTGTTCGT 


' AAACAAGGTG 


GTCCCTCCCA 


5198 



Ser Ala 

TTGAGGTAAA GACTCTGGTG AGTCCTCAAC GTTACTCGTT GAGTCTGCTG CGGTTCGATT 5258 
CCATTCCCAA GCAGCAAAGG GTGCGCAACT AGTACGGCGC CCCCTGGGAT ACCA 5312 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1704 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



Met 


Tyr 


Ala 


Lys 


Ala 


Thr 


Asp 


Val 


Ala 


Arg 


Val 


Tyr 


Ala 


Ala 


Ala 


Asp 


1 








5 










10 










15 


Val 


Ala 


Tyr 


Ala 


Asn 


Val 


Leu 


Gin 


Gin 


Arg 


Ala 


Val 


Lys 


Leu 


Asp 


Phe 








20 










25 










30 




Ala 


Pro 


Pro 


Leu 


Lys 


Ala 


Leu 


Glu 


Thr 


Leu 


His 


Arg 


Leu 


Tyr 


Tyr' 


Pro 






35 










40 










45 






Leu 


Arg 
50 


Phe 


Lys 


Gly 


Gly 


Thr 
55 


Leu 


Pro 


Pro 


Thr 


Gin 
60 


His 


Pro 


He 


Leu 


Ala 


Gly 


His 


Gin 


Arg 


Val 


Ala 


Glu 


Glu 


Val 


Leu 


His 


Asn 


Phe 


Ala 


Arg 


65 










70 










75 










80 


Gly 


Arg 


Ser 


Thr 


Val 
85 


Leu 


Glu 


He 


Gly 


Pro 
90 


Ser 


Leu 


His 


Ser 


Ala 
95 


Leu 


Lys 


Leu 


His 


Gly 


Ala 


Pro 


Asn 


Ala 


Pro 


Val 


Ala 


Asp 


Tyr 


His 


Gly 


Cys 








100 










105 










110 


Thr 


Lys 


Tyr 


Gly 


Thr 


Arg 


Asp 


Gly Ser 


Arg 


His 


He 


Thr 


Ala 


Leu 


Glu 






115 










120 










125 








Ser' 


Arg 


Ser 


Val 


Ala 


Thr 


Gly 


Arg 


Pro 


Glu 


Phe 


Lys 


Ala 


Asp 


Ala 


Ser 




130 










135 










140 








Leu 


Leu 


Ala 


Asn 


Gly 


He 


Ala 


Ser Arg 


Thr 


Phe 


Cys 


Val 


Asp 


Gly 


Val 


145 










150 










155 






160 


Gly 


Ser 


Cys 


Ala 


Phe 


Lys 


Ser 


Arg Val 


Gly 


He 


Ala 


Asn 


His 


Ser 


Leu 










165 










170 










175 




Tyr 


Asp 


Val 


Thr 


Leu 


Glu 


Glu 


Leu 


Ala 


Asn 


Ala 


Phe 


Glu 


Asn 


His 


Gly 








180 










185 










190 




Leu 


His 


Met 


Val 


Arg 


Ala 


Phe 


Met 


His 


Met 


Pro 


Glu 


Glu 


Leu 


Leu 


Tyr 
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195 

Met. Asp Asn Val 
210 

Glu Glu Pro Met 
225 

Arg Leu His Phe 

Arg lie Glu Arg 
260 

lie Phe Ser Gly 
275 

His Thr Trp Leu 
290 

Gly Phe Ser Leu 
305 

Glu Leu Arg lie 

Val Pro Arg Thr 
340 

Tyr Ala Asp Ala 
355 

His Lys Val Asn 
370 

Glu Leu Val Asp 
385 

Arg Ala He Val 

Ser Pro Ala Asp 
420 

He He Glu Arg 
435 

Asp Val Phe Gly 
450 

Gly Ser Cys Cys 
465 

Thr Lys Arg Val 

He Cys Asp Val 
500 

Arg Val Pro Pro 
515 

Leu Arg Glu Ala 
530 

Pro Val Glu Glu 
545 

Ala Ala Ser Leu 

Thr Asp Val Lys 
580 

He Asp Gly Leu 
595 

Gly Pro Pro Gly 
610 

Ala Ala Gly Gly 
625 

Glu Ala Met Asp 



Val Asn Ala 
215 

Ala Val Lys 

230 
Pro Glu Leu 
245 

Leu Ala Ala 

Asp Asp Asp 

Ala Tyr Leu 
295 

His He Glu 

310 
Thr Arg Ala 
325 

Ser Gin Gly 

Ser Gly Thr 

Met Leu Leu 
375 

Met Thr Val 
390 

Val Ala Ser- 
405 

Leu Val Arg 

Arg Arg Ala 

Glu Thr Ser 
455 

Gly Leu Arg 

470 
Val Asp Lys 
485 

Arg Leu Ser 

Ala Arg Val 

Gly Cys Tyr 
535 

Pro Gin Gly 

550 
Pro Glu Tyr 
565 

Gin Leu Lys 

Thr Leu Ser 

Ser Gly Lys 
615 

Lys Ala Leu 

630 
Arg Arg lie 
645 



200 205 
Glu Leu Gly Tyr Arg Phe His Val He 
220 

Asp Cys Ala Phe Gin Gly Gly Asp Leu 
235 240 
Asp Phe He Asn Glu Ser Gin Glu Arg 

250 255 
Arg Gly Ser Tyr Ser Arg Arg Ala Val 

265 270 
Trp Gly Asp Ala Tyr Leu His Asp Phe 
280 285 
Leu Val Arg Asn Tyr Pro Thr Pro Phe 
300 

Val Gin Arg Arg His Gly Ser Ser lie 
315 320 
Pro Pro Gly Asp Arg Met Leu Ala Val 

330 335 
Leu Cys Arg He Pro Asn He Phe Tyr 

345 350 
Glu His Lys Thr He Leu "'Thr Ser Gin 
360 365 
Asn Phe Met Gin Thr Arg Pro Glu Lys 
380 

Leu Met Ser Phe Ala Arg Ala Arg Leu 
395 400 
Glu Val Thr Glu Ser Ser Trp Asn He 

410 415 
Thr Val Val Ser Leu Tyr Val Leu His 

425 430 
Ala Val Ala Val Lys Thr Ala Lys Asp 
440 445 
Phe Trp Glu Ser Leu Lys His Val Leu 
460 

Asn Leu Lys Gly Thr Asp Val Val Phe 
475 480 
Tyr Arg Val His Ser Leu Gly Asp He 

490 495 
Pro Glu Gin Val Gly Phe Leu Pro Ser 

505 510 
Phe His Asp Arg Glu Glu Leu Glu Val 
520 525 
Asn Glu Arg Pro Val Pro Ser Thr Pro 
540 

Phe Asp Ala Asp Leu Trp His Ala Thr 
555 560 
Arg Ala Thr Leu Gin Ala Gly Leu Asn 

570 575 
lie Thr Leu Glu Asn Ala Leu Lys Thr 

585 590 
Pro Val Arg Gly Leu Glu Met Tyr Glu 
600 605 
Thr Gly Thr Leu He Ala Ala Leu Glu 
620 

Tyr Val Ala Pro Thr Arg Glu Leu Arg 
635 640 
Lys Pro Pro Ser Ala Ser Ala Thr Gin 
650 655 
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His Val Ala Leu Ala He Leu Arg Arg Ala Thr Ala Glu Gly Ala Pro 

660 665 670 

Phe Ala Thr Val Val He Asp Glu Cys Phe Met Phe Pro Leu Val Tyr 

675 680 685 

Val Ala He Val His Ala Leu Ser Pro Ser Ser Arg He Val Leu Val 

690 695 700 

Gly Asp Val His Gin lie Gly Phe He Asp Phe Gin Gly Thr Ser Ala 
705 710 715 720 

Asn Met Pro Leu Val Arg Asp Val Val Lys Gin Cys Arg Arg Arg Thr 

725 730 735 

Phe Asn Gin Thr Lys Arg Cys Pro Ala Asp Val Val Ala Thr Thr Phe 

740 745 750 

Phe Gin Ser Leu Tyr Pro Gly Cys Thr Thr Thr Ser Gly Cys Val Ala 

755 760 765 

Ser He Ser His Val Ala Pro Asp Tyr Arg Asn Ser Gin Ala Gin Thr 

770 775 780 

Leu Cys Phe Thr Gin Glu Glu Lys Ser Arg His Gly Ala Glu Gly Ala 
785 790 795 800 

Met Thr Val His Glu Ala Gin Gly Arg Thr Phe Ala Ser Val He Leu 

805 810 815 

His Tyr Asn Gly Ser Thr Ala Glu Gin Lys Leu Leu Ala Glu Lys Ser 

820 825 830 

His Leu Leu Val Gly He Thr Arg His Thr Asn His Leu Tyr lie Arg 

835 840 845 

Asp Pro Thr Gly Asp He Glu Arg Gin Leu Asn His Ser Ala Lys Ala 

850 855 860 

Glu Val Phe Thr Asp He Pro Ala Pro Leu Glu He Thr Thr Val Lys 
865 870 875 880 

Pro Ser Glu Glu Val Gin Arg Asn Glu Val Met Ala Thr lie Pro Pro 

885 890 895 

Gin Ser Ala Thr Pro His Gly Ala He His Leu Leu Arg Lys Asn Phe 

900 905 910 

Gly Asp Gin Pro Asp Cys Gly Cys Val Ala Leu Ala Lys Thr Gly Tyr 

915 920 925 

Glu Val Phe Gly Gly Arg Ala Lys He Asn Val Glu Leu Ala Glu Pro 

930 935 940 

Asp Ala Thr Pro Lys Pro His Arg Ala Phe Gin Glu Gly Val Gin Trp 
945 950 955 960 

Val Lys Val Thr Asn Ala Ser Asn Lys His Gin Ala Leu Gin Thr Leu 

965 970 975 

Leu Ser Arg Tyr Thr Lys Arg Ser Ala Asp Leu Pro Leu His Glu Ala 

980 985 990 

Lys Glu Asp Val Lys Arg Met Leu Asn Ser Leu Asp Arg His Trp Asp 

995 1000 1005 

Trp Thr Val Thr Glu Asp Ala Arg Asp Arg Ala Val Phe Glu Thr Gin 

1010 1015 1020 

Leu Lys Phe Thr Gin Arg Gly Gly Thr Val Glu Asp Leu Leu Glu Pro 
1025 1030 1035 1040 

Asp Asp Pro Tyr lie Arg Asp He Asp Phe Leu Met Lys Thr Gin Gin 

1045 1050 1055 

Lys Val Ser Pro Lys Pro lie Asn Thr Gly Lys Val Gly Gin Gly lie 

1060 1065 1070 

Ala Ala His Ser Lys Ser Leu Asn Phe Val Leu Ala Ala Trp lie Arg 

1075 1080 1085 

lie Leu Glu Glu lie Leu Arg Thr Gly Ser Arg Thr Val Arg Tyr Ser 

1090 1095 HOO 

Asn Gly Leu Pro Asp Glu Glu Glu Ala Met Leu Leu Glu Ala Lys lie 
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1105 1110 1115 1120 

Asn Gin Val Pro His Ala Thr Phe Val Ser Ala Asp Trp Thr Glu Phe 

1125 1130 1135 

Asp Thr Ala His Asn Asn Thr Ser Glu Leu Leu Phe Ala Ala Leu Leu 

1140 1145 1150 

Glu Arg He Gly Thr Pro Ala Ala Ala Val Asn Leu Phe Arg Glu Arg 

1155 ' 1160 1165 

Cys Gly Lys Arg Thr Leu Arg Ala Lys Gly Leu Gly Ser Val Glu Val 

1170 1175 1180 

Asp Gly Leu Leu Asp Ser Gly Ala Ala Trp Thr Pro Cys Arg Asn Thr 
1185 1190 1195 1200 

He Phe Ser Ala Ala Val Met Leu Thr Leu Phe Arg Gly Val Lys Phe 

1205 1210 1215 

Ala Ala Phe Lys Gly Asp Asp Ser Leu Leu Cys Gly Ser His Tyr Leu 

1220 1225 1230 

Arg Phe Asp Ala Ser Arg Leu His Met Gly Glu Arg Tyr Lys Thr Lys 

1235 1240 1245 

His Leu Lys Val Glu Val Gin Lys He Val Pro Tyr He Gly Leu Leu 

1250 1255 1260 

Val Ser Ala Glu Gin Val Val Leu Asp Pro Val Arg Ser 'Ala Leu Lys 
1265 1270 1275 1280 

He Phe Gly Arg Cys Tyr Thr Ser Glu Leu Leu Tyr Ser Lys Tyr Val 

1285 1290 1295 

Glu Ala Val Arg Asp He Thr Lys Gly Trp Ser Asp Ala Arg Tyr His 

1300 1305 1310 

Ser Leu Leu Cys His Met Ser Ala Cys Tyr Tyr Asn Tyr Ala Pro Glu 

1315 1320 1325 

Ser Ala Ala Tyr lie lie Asp Ala Val Val Arg Phe Gly Arg Gly Asp 

1330 1335 1340 

Phe Pro Phe Glu Gin Leu Arg Val Val Arg Ala His Val Gin Ala Pro 
1345 1350 1355 1360 

Asp Ala Tyr Ser Ser Thr Tyr Pro Ala Asn Val Arg Ala Ser Cys Leu 

1365 1370 1375 

Asp His Val Phe Glu Pro Arg Gin Ala Ala Ala Pro Ala Gly Phe Val 

1380 1385 1390 

Ala Thr Cys Ala Lys Pro Glu Thr Pro Ser Ser Leu Thr Ala Lys Ala 

1395 1400 1405 

Gly Val Ser Ala Thr Thr Ser His Val Ala Thr Gly Thr Ala Pro Pro 

1410 1415 1420 

Glu Ser Pro Trp Asp Ala Pro Ala Ala Asn .Ser Phe Ser Glu Leu Leu 
1425' 1430 1435 1440 

Thr Pro Glu Thr Pro Ser Thr Ser Ser Ser Pro Ser Ser Ser Ser Ser 

1445 1450 1455 

Asp Ser Ser Thr Ser Cys Gly Arg Ser Leu Ser Gly Gly Asp Thr Ala 

1460 1465 1470 

Arg Thr Thr Glu Asp Leu Asn Ser Arg Lys Pro Pro Ser Gin Asp Arg 

1475 1480 1485 

Gin Ser Arg Ser Ser Glu Cys Leu Asp Arg Ser Gly Glu Arg Thr Gly 

1490 1495 1500 

Ser Ser Leu Thr Ala Pro Thr Ala Pro Ser Pro Ser Phe Ser Phe Ser 
1505 1510 . 1515 1520 

Glu Arg Ala Arg Leu Ala Thr Gly Pro Thr Val Ala Ala Ala Thr Ser 

1525 1530 1535 

Pro Ser Ala Thr Pro Ser Cys Ala Thr Asp Gin Val Ala Ala Arg Thr 

1540 1545 1550 

Thr Pro Asp Phe Ala Pro Phe Leu Gly Ser Gin Ser Ala Arg Ala Val 
1555 1560 - 1565 
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Ser Lys Pro Tyr Arg Pro Pro Thr Thr Ala Arg Trp Lys Glu Val Thr 

1570 1575 1580 

Pro Leu His Ala Trp Lys Gly Val Thr Gly Asp Arg Pro Glu Val Arg 
1585 1590 1595 1600 

Glu Asp Pro Glu Thr Ala Ala Val Val Gin Ala Leu He Ser Gly Arg 

1605 1610 1615 

Tyr Pro Gin Lys Thr Lys Leu Ser Ser Asp Ala Ser Lys Gly Tyr Ser 

1620 1625 1630 

Arg Thr Lys Gly Cys Ser Gin Ser Thr Ser Phe Pro Ala Pro Ser Ala 

1635 1640 1645 

Asp Tyr Gin Ala Arg Asp Cys Gin Thr Val Arg Val Cys Arg Ala Ala 

1650 1655 1660 

Ala Glu Met Ala Arg Ser Cys He His Glu Pro Leu Ala Ser Ser Ala 
1665 1670 1675 1680 

Ala Ser Ala Asp Leu Lys Arg He Arg Ser Thr Ser Asp Ser Val Pro 

1685 1690 1695 

Asp Val Lys lie Ser Lys Ser Ala 
1700 

(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4218.. 4512 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG -GGAACAATGT ACGCGAAAGC GACAGACGTG 60 

GCGCGTGTCT ACGCCGCGGC AGATGTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 120 

AAGTTGGACT TCGCCCCGCC ACTGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 180 

CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 240 

CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 300 

GGGCCGTCTC TGCACAGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 360 

TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC GACACATTAC GGCCTTAGAG 420 

TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 4 80 

GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 540 

GTTGGAATTG CCAATCACTC CCTCTATGAC GTGACCCTAG AGGAGCTGGC CAATGCGTTT 600 

GAGAACCACG GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 660 

ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCCTATG 720 

GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 780 

TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 840 

AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 900 

CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 960 

CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 1020 

CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 1080 

AACATCTTTT ATTACGCCGA CGCGTCGGGC ACTGAGCATA AGACCATCCT TACGTCACAG 114 0 

CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA ACTAGTCGAC 1200 

ATGACCGTCT TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 1260 

GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 

TACGTCCTCC ACATCATCGA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 1380 

GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 14 4 0 

GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 1500 
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CGAGTCCACT CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 

TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 1620 

CTTCGCGAAG CTGGCTGCTA CAACGAACGT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 1680 

CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 174 0 

GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 

GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GAT GT AC GAG 18 60 

GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 1920 

AAAGCACTTT ACGTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GCGGATCAAA 1980 

CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 204 0 

GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 2100 

GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 2160 

CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGCGAACA TGCCGCTCGT TCGCGACGTC 2220 

GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 2280 

GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 2340 

TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 2400 

CAGGAGGAAA AGTCGCGCCA CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGCAGGGA 24 60 

CGCACTTTTG CGTCTGTCAT TCTGCATTAC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 2520 

GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 2580 

GACCCGACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 2640 

GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 2700 

GAAGTGATGG CAACGATACC CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 27 60 

CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 2820 

GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 2880 

AAGCCGCATA GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 2940 

AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 

CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 

TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 

CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 3180 

GACTTCCTTA TGAAGACTCA GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 3240 

GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 3300 

ATACTCGAGG AGATACTCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 3360 

GACGAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 3420 

GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA ACACGAGTGA GCTGCTCTTC 34 80 

GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 3540 

TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 

GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCTGCCGC CGTCATGCTC 3660 

ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT CCTCTGTGGT 3720 

AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 3780. 
CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG ■ 3840 

CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 3900 

GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GTGAGAGACA TCACCAAGGG CTGGAGTGAC 3960 

GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 4 020 

TCTGCGGCGT AC AT CAT CG A CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 4 080 

CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 4140 

GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 4200 

GCAGGTTTC'G TTGCGAC ATG TGC GAA GCC GGA AAC GCC TTC TTC ACT TAC 4 250 

Met Cys Glu Ala Gly Asn Ala Phe Phe Thr Tyr 
15 10 



CGC 


GAA 


AGC 


TGG TGT 


TTC 


TGC 


GAC 


TAC 


AAG 


CCA 


CGT 


TGC 


GAC 


TGG 


GAC 


4298 


Arg 


Glu 


Ser 


Trp Cys 


Phe 


Cys 


Asp 


Tyr 


Lys 


Pro 


Arg Cys 


Asp 


Trp Asp 










15 








20 










25 








TGC 


GCC 


CCC 


GGA GTC 


TCC 


ATG 


GGA 


TGC 


ACC 


TGC 


AGC 


CAA 


CAG 


CTT 


TTC 


4346 


Cys 


Ala 


Pro 


Gly Val 


Ser 


Met 


Gly Cys 


Thr 


Cys 


Ser 


Gin 


Gin 


Leu 


Phe 








30 








35 








40 










GGA 


GTT 


ATT 


GAC ACC 


GGA 


GAC 


CCC 


GTC 


CAC 


ATC 


ATC 


CTC 


GCC 


GTC 


ATC 


4394 


Gly 


Val 


He 


Asp Thr 


Gly Asp 


Pro 


Val 


His 


He 


He 


Leu 


Ala 


Val 


He 





45 50 55 



20 



GTC TTC ATC GGA CTC CTC TAC ATC GTG TGG AAG GTC GCT CAG TGG TGG 4 4 42 
Val Phe He Gly Leu Leu Tyr He Val Trp Lys Val Ala Gin Trp Trp 
60 65 70 75 

AG A CAC CGC AAG GAC CAC AGA AG A CTT GAA CAG CAG AAA GCC GCC TTC 4 4 90 
Arg His Arg Lys Asp His Arg Arg Leu Glu Gin Gin Lys Ala Ala Phe 

80 85 90 

GCA AGA CAG GCA ATC ACG CTC GTC TGAATGTC TGGACAGAAG CGGAGAAAGG 4542 
Ala Arg Gin Ala He Thr Leu Val 
95 

ACAGGCAGTT CGTTAACTGC CCCCACTGCT CCGAGCCCCT CATTCTCATT TTCGGAAAGA 4 602 

GCTCGACTGG CGACCGGGCC GACTGTCGCC GCTGCGACAT CACCTTCGGC AACCCCATCC 4 662 

TGCGCCACGG ACCAGGTTGC CGCGAGGACC ACGCCGGACT TTGCGCCTTT CCTGGGTTCC 4722 

CAGTCTGCCC GTGCTGTCTC GAAGCCGTAC CGGCCCCCCA CGACTGCCCG TTGGAAAGAA 4 782 

GTCACCCCGC TCCACGCGTG GAAGGGCGTG ACCGGAGACC GACCGGAAGT CAGGGAGGAC 4 842 

CCGGAGACAG CGGCGGTCGT CCAGGCTCTG ATCAGCGGCC GTTATCCTCA G AAG ACG AAG 4 902 

CTTTCCTCCG ACGCATCCAA AGGCTACTCA AGAACTAAGG GATGCTCACA ATCCACCTCT 4 962 

TTTCCTGCCC CGAGTGCGGA TTACCAGGCC CGCGACTGCC AGACAGTCCG AGTCTGCCGC 5022 

GCCGCTGCAG AGATGGCGCG CTCATGTATT CACGAGCCGT TGGCTTCATC TGCCGCCAGT 5082 

GCCGACTTGA AG CGC AT ACG CTCTACCTCG GACTCTGTTC CCGATGTAAA GAT CAG C AAG 514 2 

AGCGCATGAA GGAACAAAAT TAGTTTCCTT GTTCGTAAAC AAGGTGGTCC CTCCCATTGA 5202 

GGTAAAGACT CTGGTGAGTC CTCAACGTTA CTCGTTGAGT CTGCTGCGGT TCGATTCCAT 5262 

TCCCAAGCAG CAAAGGGTGC GCAACTAGTA CGGCGCCCCC TGGGATACCA 5312 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Cys Glu Ala Gly Asn Ala Phe Phe Thr Tyr Arg Glu Ser Trp Cys 

15 10 15 

Phe Cys Asp Tyr Lys Pro Arg Cys Asp Trp Asp Cys Ala Pro Gly Val 

20 25 30 

Ser Met Gly Cys Thr Cys Ser Gin Gin Leu Phe Gly Val He Asp Thr 

35 40 45 

Gly Asp Pro Val His He He Leu Ala Val He Val Phe He Gly Leu 

50 55 60 

Leu Tyr He Val Trp Lys Val Ala Gin Trp Trp Arg His Arg Lys Asp 
65 70 75 80 

His Arg Arg Leu Glu Gin Gin Lys Ala Ala Phe Ala Arg Gin Ala He 
85 90 95 

Thr Leu Val 

(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 4518.. 4937 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
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GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT ACGCGAAAGC GACAGACGTG 60 

GCGCGTGTCT ACGCCGCGGC AGATGTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 120 

AAGTTGGACT TCGCCCCGCC ACTGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 180 

CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 240 

CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 300 

GGGCCGTCTC TGCACAGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 360 

TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC GACACATTAC GGCCTTAGAG 420 

TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 4 80 

GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 540 

GTTGGAATTG CCAATCACTC CCTCTATGAC GTGACCCTAG AGGAGCTGGC CAATGCGTTT 600 

GAGAACCACG GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 660 

ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCCTATG 720 

GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 780 

TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 84 0 

AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 900 

CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 960 

CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 1020 

CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 1080 

AACATCTTTT ATTACGCCGA CGCGTCGGGC ACTGAGCATA AGACCATCCT TACGTCACAG 114 0 

CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA ACTAGTCGAC 1200 

ATGACCGTCT TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 1260 

GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 

TACGTCCTCC ACATCATCGA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 1380 

GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 14 4 0 

GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 1500 

CGAGTCCACT CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 

TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 1620 

CTTCGCGAAG CTGGCTGCTA CAACGAACGT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 1680 

CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 1740 

GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 

GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GATGTACGAG 18 60 

GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 1920 

AAAGCACTTT ACGTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GCGGATCAAA 1980 

CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 204 0 

GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 2100 

GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 2160 

CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGCGAACA TGCCGCTCGT TCGCGACGTC 2220 

GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 2280 

GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGGA 234 0 

TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 2400 

CAGGAGGAAA AGTCGCGCCA CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGCAGGGA 24 60 

CGCACTTTTG CGTCTGTCAT TCTGCATTAC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 2520 

GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 2580 

GACCCGACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 264 0 

GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 2700 

GAAGTGATGG CAACGATACC CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 27 60 

CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 2820 

GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 2880 

AAGCCGCATA GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 294 0 

AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 

CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 

TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 

CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 3180 

GACTTCCTTA TGAAGACTCA GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 324 0 

GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 3300 

ATACTCGAGG AGATACTCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 3360 
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GAC G AAG AAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 3420 

GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA ACACGAGTGA GCTGCTCTTC 3480 

GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 3540 

TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 

GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AAC AC CAT CT TCTCTGCCGC CGTCATGCTC 3660 

ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG AC GAC T CGC T CCTCTGTGGT 3720 

AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 3780 

CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 3840 

CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 3900 

GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GTGAGAGACA TCACCAAGGG CTGGAGTGAC 3960 

GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 4 020 

TCTGCGGCGT ACATCATCGA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 4 080 

CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 4140 

GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 4200 

GCAGGTTTCG TTGCGACATG TGCGAAGCCG GAAACGCCTT CTTCACTTAC CGCGAAAGCT 4260 

GGTGTTTCTG CG ACT AC AAG CCACGTTGCG ACTGGGACTG CGCCCCCGGA GTCTCCATGG 4 320 

GATGCACCTG CAGCCAACAG CTTTTCGGAG T TAT T GAC AC CGGAGACCCC GTCCACATCA 4 380 

TCCTCGCCGT CATCGTCTTC ATCGGACTCC TCTACATCGT GTGGAAGGTC GCTCAGTGGT 4 440 

GGAGACACCG CAAGGACCAC AG AAG ACT TG AACAGCAGAA AGCCGCCTTC GCAAGACAGG 4 500 

CAATCACGCT CGTCTGA ATG TCT GGA CAG AAG CGG AGA AAG GAC AGG CAG 4 550 

Met Ser Gly Gin Lys Arg Arg Lys Asp Arg Gin 
1 5 10 

TTC GTT AAC TGC CCC CAC TGC TCC GAG CCC CTC ATT CTC ATT TTC GGA 4 598 
Phe Val Asn Cys Pro His Cys Ser Glu Pro Leu He Leu He Phe Gly 

15 20 25 

AAG AGC TCG ACT GGC GAC CGG" GCC GAC TGT CGC CGC TGC GAC ATC ACC 4 64 6 
Lys Ser Ser Thr Gly Asp Arg Ala Asp Cys Arg Arg Cys Asp He Thr 

30 35 '40 

TTC GGC AAC CCC ATC CTG CGC CAC GGA CCA GGT TGC CGC GAG GAC CAC 4 694 
Phe Gly Asn Pro He Leu Arg His Gly Pro Gly Cys Arg Glu Asp His 

45 50 55 

GCC GGA CTT TGC GCC TTT CCT GGG TTC CCA GTC TGC CCG TGC TGT CTC 4 742 
Ala Gly Leu Cys Ala Phe Pro Gly Phe Pro Val Cys Pro Cys Cys Leu 
60 65 70 75 

GAA GCC GTA CCG GCC CCC CAC GAC TGC CCG TTG GAA AGA AGT CAC CCC 4 790 
Glu Ala Val Pro Ala Pro His Asp Cys Pro Leu Glu Arg Ser His Pro 

80 85 90 

GCT CCA CGC GTG GAA GGG CGT GAC CGG AGA CCG ACC GGA AGT CAG GGA 4 838 
Ala Pro Arg Val Glu Gly Arg Asp Arg Arg Pro Thr Gly Ser Gin Gly 

95 100 105 

GGA CCC GGA GAC AGC GGC GGT CGT CCA GGC TCT GAT CAG CGG CCG TTA 4 886 
Gly Pro Gly Asp Ser Gly Gly Arg Pro Gly Ser Asp Gin Arg Pro Leu 

HO 115 120 

TCC TCA GAA GAC GAA GCT TTC CTC CGA CGC ATC CAA AGG CTA CTC AAG 4 934 
Ser Ser Glu Asp Glu Ala Phe Leu Arg Arg He Gin Arg Leu Leu Lys 

125 130 135 

AAC TAAGGGATGC TCACAATCCA CCTCTTTTCC TGCCCCGAGT GCGGATTACC 4 987 
Asn 
140 

AGGCCCGCGA CTGCCAGACA GTCCGAGTCT GCCGCGCCGC TGCAGAGATG GCGCGCTCAT 5047 

GTATTCACGA GCCGTTGGCT TCATCTGCCG CCAGTGCCGA CTTGAAGCGC ATACGCTCTA 5107 

CCTCGGACTC TGTTCCCGAT GTAAAGATCA GCAAGAGCGC ATGAAGGAAC AAAATTAGTT 5167 

TCCTTGTTCG TAAACAAGGT GGTCCCTCCC ATTGAGGTAA AGACTCTGGT GAGTCCTCAA 5227 

CGTTACTCGT TGAGTCTGCT GCGGTTCGAT TCCATTCCCA AGCAGCAAAG GGTGCGCAAC 5287 

TAGTACGGCG CCCCCTGGGA TACCA 5312 

(2) INFORMATION FOR SEQ ID NO: 44: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Met 


Ser 


Gly 


Gin 


Lys 


Arg 


Arg 


Lys 


Asp Arg 


Gin 


Phe 


Val 


Asn 


Cys 


Pro 


1 








5 










10 










15 




His 


Cys 


Ser 


Glu 
20 


Pro 


Leu 


He 


Leu 


He 
25 


Phe 


Gly 


Lys 


Ser 


Ser 
30 


Thr 


Gly 


Asp 


Arg 


Ala 
35 


Asp 


Cys 


Arg 


Arg 


Cys 
40 


Asp 


He 


Thr 


Phe 


Gly 
45 


Asn 


Pro 


He 


Leu 


Arg 


His 


Gly 


Pro 


Gly 


Cys 


Arg 


Glu Asp 


His 


Ala 


Gly 


Leu 


Cys 


Ala 




50 










55 










60 










Phe 


Pro 


Gly 


Phe 


Pro 


Val 


Cys 


Pro 


Cys 


Cys 


Leu 


Glu 


Ala 


Val 


Pro 


Ala 


65 










70 










75 










80 


Pro 


His 


Asp 


Cys 


Pro 
85 


Leu 


Glu 


Arg 


Ser 


His 
90 


Pro 


Ala 


Pro 


Arg 


Val 
95 


Glu 


Gly 


Arg 


Asp 


Arg 


Arg 


Pro 


Thr 


Gly 


Ser 


Gin 


Gly Gly 


Pro 


Gly 


Asp 


Ser 








100 










105 










110 






Gly 


Gly 


Arg 
115 


Pro 


Gly 


Ser 


Asp 


Gin 
120 


Arg 


Pro 


Leu 


Ser 


Ser 
125 


Glu 


Asp 


Glu 


Ala 


Phe 
130 


Leu 


Arg 


Arg 


He 


Gin 
135 


Arg 


Leu 


Leu 


Lys 


Asn 
140 











(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4944.. 5162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT ACGCGAAAGC GACAGACGTG 
GCGCGTGTCT ACGCCGCGGC AGATGTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 
AAGTTGGACT TCGCCCCGCC ACTGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 
CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 
CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 
GGGCCGTCTC TGCACAGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 
TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC G AC AC ATT AC GGCCTTAGAG 
TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 
GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 
GTTGGAATTG CCAATCACTC CCTCTATGAC GTGACCCTAG AGGAGCTGGC CAATGCGTTT 
GAGAACCACG GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 
ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCCTATG 
GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 
TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 
AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 
CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 
CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 
CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 
AACATCTTTT ATTACGCCGA CGCGTCGGGC ACTGAGCATA AGACCATCCT TACGTCACAG 
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CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA ACTAGTCGAC 1200 

ATGACCGTCT TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 1260 

GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 

TACGTCCTCC AC AT CATC GA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 1380 

GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 1440 

GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 1500 

CGAGTCCACT CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 

TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 1620 

CTTCGCGAAG CTGGCTGCTA CAACGAACGT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 1680 

CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 1740 

GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 

GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GATGTACGAG 1860 

GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 1920 

AAAGCACTTT ACGTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GCGGATCAAA 1980 

CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 204 0 

GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 2100 

GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 2160 

CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGCGAACA TGCCGCTCGT TCGCGACGTC 2220 

GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 2280 

GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 234 0 

TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 24 00 

CAGGAGGAAA AGTCGCGCCA CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGCAGGGA 24 60 

CGCACTTTTG CGTCTGTCAT TCTGCATTAC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 2520 

GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 2580 

GACCCGACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 264 0 

GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 2700 

GAAGTGATGG CAACGATACC CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 27 60 

CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 2820 

GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 2880 

AAGCCGCATA GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 2 94 0 

AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 

CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 

TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 

CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 3180 

GACTTCCTTA TGAAGACTCA GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 3240 

GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 3300 

ATACTCGAGG AGATACTCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 3360 

GACGAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 3420 

GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA ACACGAGTGA GCTGCTCTTC 3480 

GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 354 0 

TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 

GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCTGCCGC CGTCATGCTC 3660 

ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT CCTCTGTGGT 3720 

AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 3780 

CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 384 0 

CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 3900 

GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GTGAGAGACA TCACCAAGGG CTGGAGTGAC 3960 

GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 4 020 

TCTGCGGCGT ACATCATCGA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 4 080 

CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 414 0 

GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 4 200 

GCAGGTTTCG TTGCGACATG TGCGAAGCCG GAAACGCCTT CTTCACTTAC CGCGAAAGCT 4260 

GGTGTTTCTG CGACTACAAG CCACGTTGCG ACTGGGACTG CGCCCCCGGA GTCTCCATGG 4320 

GATGCACCTG CAGCCAACAG CTTTTCGGAG TTATTGACAC CGGAGACCCC GTCCACATCA 4 380 

TCCTCGCCGT CATCGTCTTC ATCGGACTCC TCTACATCGT GTGGAAGGTC GCTCAGTGGT 4 44 0 

GGAGACACCG CAAGGACCAC AGAAGACTTG AACAGCAGAA AGCCGCCTTC GCAAGACAGG 4 500 

CAATCACGCT CGTCTGAATG TCTGGACAGA AGCGGAGAAA GGACAGGCAG TTCGTTAACT 4 560 
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GCCCCCACTG CTCCGAGCCC CTCATTCTCA TTTTCGGAAA GAGCTCGACT GGCGACCGGG 4 620 
CCGACTGTCG CCGCTGCGAC ATCACCTTCG GCAACCCCAT CCTGCGCCAC GGACCAGGTT 4 680 
GCCGCGAGGA CCACGCCGGA CTTTGCGCCT TTCCTGGGTT CCCAGTCTGC CCGTGCTGTC 474 0 
TCGAAGCCGT ACCGGCCCCC CACGACTGCC CGTTGGAAAG AAGTCACCCC GCTCCACGCG 4800 
TGGAAGGGCG TGACCGGAGA CCGACCGGAA GTCAGGGAGG ACCCGGAGAC AGCGGCGGTC 4860 
GTCCAGGCTC TGATCAGCGG CCGTTATCCT CAGAAGACGA AGCTTTCCTC CGACGCATCC 4 920 
AAAGGCTACT CAAGAACTAA GGG ATG CTC ACA ATC CAC CTC TTT TCC TGC 4 970 

Met Leu Thr lie His Leu Phe Ser Cys 
1 5 

CCC GAG TGC GGA TTA CCA GGC CCG CGA CTG CCA GAC AGT CCG AGT CTG 5018 
Pro Glu Cys Gly Leu Pro Gly Pro Arg Leu Pro Asp Ser Pro Ser Leu 
10 15 20 25 

CCG CGC CGC TGC AGA GAT GGC GCG CTC ATG TAT TCA CGA GCC GTT GGC 5066 
Pro Arg Arg Cys Arg Asp Gly Ala Leu Met Tyr Ser Arg Ala Val Gly 

30 35 40 

TTC ATC TGC CGC CAG TGC CGA CTT GAA GCG CAT ACG CTC TAC CTC GGA 5114 
Phe lie Cys Arg Gin Cys Arg Leu Glu Ala His Thr Leu Tyr Leu Gly 

45 50 55 

CTC TGT TCC CGA TGT AAA GAT CAG CAA GAG CGC ATG AAG GAA CAA AAT 5162 
Leu Cys Ser Arg Cys Lys Asp Gin Gin Glu Arg Met Lys "Glu Gin Asn 

60 65 70 

TAGTTTCCTT GTTCGTAAAC AAGGTGGTCC CTCCCATTGA GGTAAAGACT CTGGTGAGTC 5222 
CTCAACGTTA CTCGTTGAGT CTGCTGCGGT TCGATTCCAT TCCCAAGCAG CAAAGGGTGC 5282 
GCAACTAGTA CGGCGCCCCC TGGGATACCA 5312 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



Met 


Leu 


Thr 


He 


His 


Leu 


Phe 


Ser 


Cys 


Pro 


Glu 


Cys 


Gly 


Leu Pro 


Gly 


1 








5 










10 








15 




Pro 


Arg 


Leu 


Pro 


Asp 


Ser 


Pro 


Ser 


Leu 


Pro 


Arg 


Arg 


Cys 


Arg Asp 


Gly 








20 










25 










30 




Ala 


Leu 


Met 


Tyr 


Ser 


Arg 


Ala 


Val 


Gly 


Phe 


He 


Cys 


Arg 


Gin Cys 


Arg 






35 










40 










45 




Leu 


Glu 


Ala 


His 


Thr 


Leu 


Tyr 


Leu 


Gly 


Leu 


Cys 


Ser 


Arg 


Cys Lys 


Asp 




50 










55 










60 








Gin 


Gin 


Glu 


Arg 


Met 


Lys 


Glu 


Gin 


Asn 














65 










70 




















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:47 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 283.. 753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GTTTTTCTTT CTTTACCAAG TGTGGTAAAA TTTAAACAAA GAAGAAAACC AGGACCGTAA 60 
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CCCGGCCCTT ACACACCTCG AGTCCGTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 120 
GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 180 
GTCGACAGAC GCTTCCGGAC CACTAGAACC TCCTCGAGCG ACGCACACAC AGCACACACA 24 0 

CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TT ATG AGC GAG CAC 294 

Met Ser Glu His 
1 

ACC ATC GCC CAC TCC ATC ACA TTA CCA CCC GGT TAC ACC CTT GCC CTA 342 
Thr He Ala His Ser He Thr Leu Pro Pro Gly Tyr Thr Leu Ala Leu 
5 10 15 20 

ATA CCC CCT GAA CCT GAA GCA GGA TGG GAG ATG CTG GAG TGG CGT CAC 390 
He Pro Pro Glu Pro Glu Ala Gly Trp Glu Met Leu Glu Trp Arg His 

25 30 35 

AGC GAC CTC ACA ACC GTC GCG GAA CCC GTA ACG TTC GGG TCA GCG CCA 4 38 

Ser Asp Leu Thr Thr Val Ala Glu Pro Val Thr Phe Gly Ser Ala Pro 

40 45 50 

ACA CCG TCA CCG TCA ATG GTA GAA GAA ACC AAC GGC GTC GGA CCG GAA 486 
Thr Pro Ser Pro Ser Met Val Glu Glu Thr Asn Gly Val Gly Pro Glu 

55 60 65 

GGC AAG TTT CTC CCC CTG ACA ATT TCA CCG CTG CTG CAC AAG ACC TCG 534 
Gly Lys Phe Leu Pro Leu Thr He Ser Pro Leu Leu His'Lys Thr Ser 

70 75 80 

CGC AAA GCC TTG ACG CCA ACA CCG TCA CTT TCC CCG CTA ACA TCT CTA 582 
Arg Lys Ala Leu Thr Pro Thr Pro Ser Leu Ser Pro Leu Thr Ser Leu 

85 90 95 100 ■ 

GCA TGC CCG AAT TCC GGA ATT GGG CCA AGG GAA AGA TCG ACC TCG ACT 630 
Ala Cys Pro Asn Ser Gly lie Gly Pro Arg Glu Arg Ser Thr Ser Thr 

105 110 115 

CCG ATT CCA TCG GCT GGT ACT TCA AGT ACC TTG ACC CAG CGG GTG CTA 678 
Pro He Pro Ser Ala Gly Thr Ser Ser Thr Leu Thr Gin Arg Val Leu 

120 125 130 

CAG AGT CTG CGC GCG CCG TCG GCG AGT ACT CGA AGA TCC CTG ACG GCC 72 6 

Gin Ser Leu Arg Ala Pro Ser Ala Ser Thr Arg Arg Ser Leu Thr Ala 

135 140 145 

TCG TCA AGT TCT CCG TCG ACG CAG AGA TAAGAGAGAT CTATAACGAG 773 
Ser Ser Ser Ser Pro Ser Thr Gin Arg 

150 155 
GAGTGCCCCG TCGTCACTGA CGTGTCCGTC CCCCTCGACG GCCGCCAGTG GAGCCTCTCG 833 
ATTTTCTCCT TTCCGATGTT CAGAACCGCC TACGTCGCCG TAGCGAACGT CGAGAACAAG 8 93 

GAGATGTCGC TCGACGTTGT CAACGACCTC ATCGAGTGGC TCAACAATCT CGCCGACTGG 953 
CGTTATGTCG TTGACTCTGA ACAGTGGATT AACTTCACCA ATGACACCAC GTACTACGTC 1013 
CGCATCCGCG TTCTACGTCC AAC C TAC GAC GTTCCAGACC CCACAGAGGG CCTTGTTCGC 1073 
ACAGTCTCAG ACTACCGCCT CAC T TAT AAG GC GAT AAC AT GTGAAGCCAA CATGCCAACA 1133 
CTCGTCGACC AAGGCTTTTG GATCGGCGGC CAGTACGCTC TCACCCCGAC TAGCCTACCG 1193 
CAG TAC GAC G TCAGCGAGGC CTACGCTCTG CACACTTTGA CCTTCGCCAG ACCATCCAGC 1253 
GCCGCTGCAC TCGCGTTTGT GTGGGCAGGT TTGCCACAGG GTGGCACTGC GCCTGCAGGC 1313 
ACTCCAGCCT GGGAGCAGGC ATCCTCGGGT GGCTACCTCA CCTGGCGCCA CAACGGTACT 1373 
ACTTTCCCAG CTGGCTCCGT TAGCTACGTT CTCCCTGAGG GTTTCGCCCT T GAG CGC TAC 14 33 
GACCCGAACG ACGGCTCTTG GACCGACTTC GCTTCCGCAG GAGACACCGT CACTTTCCGG 14 93 
CAGGTCGCCG TCGACGAGGT CGTTGTGACC AACAACCCCG CCGGCGGCGG CAGCGCCCCC 1553 
ACCTTCACCG TGAGAGTGCC CCCTTCAAAC GCT TAC ACC A ACACCGTGTT TAGGAACACG 1613 
CTCTTAGAGA CTCGACCCTC CTCTCGTAGG CTCGAACTCC CTATGCCACC TGCTGACTTT 1673 
GGACAGACGG TCGCCAACAA CCCGAAGATC GAGCAGTCGC TTCTTAAAGA AACACTTGGC 1733 
TGCTATTTGG TCCACTCCAA AATGCGAAAC CCCGTTTTCC AGCTCACGCC AGCCAGCTCC 17 93 
TTTGGCGCCG TTTCCTTCAA CAATCCGGGT TATGAGCGCA CACGCGACCT CCCGGACTAC * 1853 
ACTGGCATCC GTGACTCATT CGACCAGAAC ATGTCCACCG CTGTGGCCCA CTTCCGCTCA ' 1913 
CTCTCCCACT CCTGCAGTAT CGTCACTAAG ACCTACCAGG GTTGGGAAGG CGTCACGAAC 1973 
GTCAACACGC CTTTCGGCCA ATTCGCGCAC GCGGGCCTCC TCAAGAATGA GGAGATCCTC 2033 

27 



TGCCTCGCCG ACGACCTGGC CACCCGTCTC ACAGGTGTCT ACCCCGCCAC TGACAACTTC 2093 

GCGGCCGCCG TTTCTGCCTT CGCCGCGAAC ATGCTGTCCT CCGTGCTGAA GTCGGAGGCA 2153 

ACGTCCTCCA TCATCAAGTC CGTTGGCGAG ACTGCCGTCG GCGCGGCTCA GTCCGGCCTC 2213 

GCGAAGCTAC CCGGACTGCT AATGAGTGTA CCAGGGAAGA TTGCCGCGCG TGTCCGCGCG 2273 

CGCCGAGCGC GCCGCCGCGC CGCTCGTGCC AATTAGTTTG CTCGCTCCTG TTTCGCCGTT 2333 

TCGTAAAACG GCGTGGTCCC GCACATTACG CGTACCCTAA AGACTCTGGT GAGTCCCCGT 2393 

CGTTACACGA CGGGTCTGCC GCGGTTCGAT TCCATTCCCA AGCGGCAAGA AGGACGTAGT 24 53 

TAGCTCTGCG TCCCTCGGGA TACCA 2478 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Met 


Ser 


Glu 


His 


Thr 


He 


Ala 


His 


Ser 


He 


Thr 


Leu 


Pro 


Pro 


Gly Tyr 


1 








5 










10 










15 




Thr 


Leu 


Ala 


Leu 


He 


Pro 


Pro 


Glu 


Pro 


Glu 


Ala 


Gly 


Trp 


Glu 


Met 


Leu 








20 










25 










30 






Glu 


Trp 


Arg 


His 


Ser 


Asp 


Leu 


Thr 


Thr 


Val 


Ala 


Glu 


Pro 


Val 


Thr 


Phe 






35 










40 










45 








Gly 


Ser 


Ala 


Pro 


Thr 


Pro 


Ser 


Pro 


Ser 


Met 


Val 


Glu 


Glu 


Thr 


Asn 


Gly 




50 










55' 










60 








Val 


Gly 


Pro 


Glu 


Gly 


Lys 


Phe 


Leu 


Pro 


Leu 


Thr 


He 


Ser 


Pro 


Leu 


Leu 


65 










70 










75 










80 


His 


Lys 


Thr 


Ser 


Arg 


Lys 


Ala 


Leu 


Thr 


Pro 


Thr 


Pro 


Ser 


Leu 


Ser 


Pro 










85 










90 










95 




Leu 


Thr 


Ser 


Leu 


Ala 


Cys 


Pro 


Asn 


Ser 


Gly 


He 


Gly 


Pro 


Arg 


Glu 


Arg 








100 










105 










110 






Ser 


Thr 


Ser 


Thr 


Pro 


He 


Pro 


Ser 


Ala 


Gly 


Thr 


Ser 


Ser 


Thr 


Leu 


Thr 






115 










120 










125 








Gin 


Arg 


Val 


Leu 


Gin 


Ser 


Leu 


Arg 


Ala 


Pro 


Ser 


Ala 


Ser 


Thr 


Arg 


Arg 




130 










135 










140 






Ser 


Leu 


Thr 


Ala 


Ser 


Ser 


Ser 


Ser 


Pro 


Ser 


Thr 


Gin 


Arg 








145 










150 










155 











(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 366.. 2306 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTTTTTCTTT CTTTACCAAG TGTGGTAAAA TTTAAACAAA GAAGAAAACC AGGACCGTAA 60 

CCCGGCCCTT ACACACCTCG AGTCCGTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 120 

GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 180 

GTCGACAGAC GCTTCCGGAC CACTAGAACC TCCTCGAGCG ACGCACACAC AGCACACACA 24 0 

CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TTATGAGCGA GCACACCATC 300 

GCCCACTCCA TCACATTACC ACCCGGTTAC ACCCTTGCCC TAATACCCCC TGAACCTGAA 360 



28 



GCAGG ATG GGA GAT GCT GGA GTG GCG TCA CAG CGA CCT CAC AAC CGT 407 
Met Gly Asp Ala Gly Val Ala Ser Gin Arg Pro His Asn Arg 
15 10 



CGC 


GGA 


ACC 


CGT 


AAC 


GTT 


CGG 


GTC 


AGC 


GCC 


AAC 


ACC 


GTC 


ACC 


GTC 


AAT 


455 


Arg 


Gly 


Thr 


Arg 


Asn 


Val 


Arg 


Val 


Ser 


Ala 


Asn 


Thr 


Val 


Thr 


Val 


Asn 




15 










20 










25 










30 




GGT 


AGA 


AGA 


AAC 


CAA 


CGG 


CGT 


CGG 


ACC 


GGA 


AGG 


CAA 


GTT 


TCT 


CCC 


CCT 


503 


Gly Arg 


Arg 


Asn 


Gin 


Arg 


Arg 


Arg 


Thr 


Gly 


Arg 


Gin 


Val 


Ser 


Pro 


Pro 












35 










40 










45 






GAC 


AAT 


TTC 


ACC 


GCT 


GCT 


GCA 


CAA 


GAC 


CTC 


GCG 


CAA 


AGC 


CTT 


GAC 


GCC 


551 


Asp Asn 


Phe 


Thr 


Ala 


Ala 


Ala 


Gin 


Asp 


Leu 


Ala 


Gin 


Ser 


Leu 


Asp 


Ala 










50 










55 










60 






AAC 


ACC 


GTC 


ACT 


TTC 


CCC 


GCT 


AAC 


ATC 


TCT 


AGC 


ATG 


CCC 


GAA 


TTC 


CGG 


599 


Asn 


Thr 


Val 


Thr 


Phe 


Pro 


Ala 


Asn 


He 


Ser 


Ser 


Met 


Pro 


Glu 


Phe 


Arg 








65 










70 










75 








AAT 


TGG 


GCC 


AAG 


GGA 


AAG 


ATC 


GAC 


CTC 


GAC 


TCC 


GAT 


TCC 


ATC 


GGC 


TGG 


647 


Asn 


Trp 
80 


Ala 


Lys 


Gly 


Lys 


He 
85 


Asp 


Leu 


Asp 


Ser 


Asp 
90 


Ser 


He 


Gly 


Trp 




TAC 


TTC 


AAG 


TAC 


CTT 


GAC 


CCA 


GCG 


GGT 


GCT 


ACA 


GAG 


TCT 


GCG 


CGC 


GCC 


695 


Tyr 


Phe 


Lys 


Tyr 


Leu 


Asp 


Pro 


Ala 


Gly 


Ala 


Thr 


Glu 


Ser Ala 


Arg 


Ala 




95 










100 










105 










110 




GTC 


GGC 


GAG 


TAC 


TCG 


AAG 


ATC 


CCT 


GAC 


GGC 


CTC 


GTC 


AAG 


TTC 


TCC 


GTC 


743 


Val 


Gly 


Glu 


Tyr 


Ser 
115 


Lys 


He 


Pro 


Asp 


Gly 
120 


Leu 


Val 


Lys 


Phe 


Ser 
125 


Val 




GAC 


GCA 


GAG 


ATA 


AGA 


GAG 


ATC 


TAT 


AAC 


GAG 


GAG 


TGC 


CCC 


GTC 


GTC 


ACT 


791 


Asp 


Ala 


Glu 


He 
130 


Arg 


Glu 


He 


Tyr 


Asn 
135 


Glu 


Glu 


Cys 


Pro 


Val 
140 


Val 


Thr 




GAC 


GTG 


TCC 


GTC 


CCC 


CTC 


GAC 


GGC 


CGC 


CAG 


TGG 


AGC 


CTC 


TCG 


ATT 


TTC 


839 


Asp 


Val 


Ser 
145 


Val 


Pro 


Leu 


Asp 


Gly 
150 


Arg 


Gin 


Trp 


Ser 


Leu 
155 


Ser 


He 


Phe 




TCC 


TTT 


CCG 


ATG 


TTC 


AGA 


ACC 


GCC 


TAC 


GTC 


GCC 


GTA 


GCG 


AAC 


GTC 


GAG 


887 


Ser 


Phe 
160 


Pro 


Met 


Phe 


Arg 


Thr 
165 


Ala 


Tyr 


Val 


Ala 


Val 
170 


Ala 


Asn 


Val 


Glu 




AAC 


AAG 


GAG 


ATG 


TCG 


CTC 


GAC 


GTT 


GTC 


AAC 


GAC 


CTC 


ATC 


GAG 


TGG 


CTC 


935 


Asn 


Lys 


Glu 


Met 


Ser 


Leu 


Asp 


Val 


Val 


Asn 


Asp 


Leu 


He 


Glu 


Trp 


Leu 




175 










180 










185 








190 




AAC 


AAT 


CTC 


GCC 


GAC 


TGG 


CGT 


TAT 


GTC 


GTT 


GAC 


TCT 


GAA 


CAG 


TGG 


ATT 


983 


Asn 


Asn 


Leu 


Ala 


Asp 
195 


Trp 


Arg 


Tyr 


Val 


Val 
200 


Asp 


Ser 


Glu 


Gin 


Trp 
205 


He 




AAC 


TTC 


ACC 


AAT 


GAC 


ACC 


ACG 


TAC 


TAC 


GTC 


CGC 


ATC 


CGC 


GTT 


CTA 


CGT 


1031 


Asn 


Phe 


Thr 


Asn 
210 


Asp 


Thr 


Thr 


Tyr 


Tyr 
215 


Val 


Arg 


He 


Arg 


Val 
220 


Leu 


Arg 




CCA 


ACC 


TAC 


GAC 


GTT 


CCA 


GAC 


CCC 


ACA 


GAG 


GGC 


CTT 


GTT 


CGC 


ACA 


GTC 


1079 


Pro 


Thr 


Tyr 


Asp 


Val 


Pro 


Asp 


Pro 


Thr 


Glu 


Gly 


Leu 


Val 


Arg 


Thr 


Val 








225 










230 










235 








TCA 


GAC 


TAC 


CGC 


CTC 


ACT 


TAT 


AAG 


GCG 


ATA 


ACA 


TGT 


GAA 


GCC 


AAC 


ATG 


1127 


Ser 


Asp 
240 


Tyr 


Arg 


Leu 


Thr 


Tyr 
245 


Lys 


Ala 


He 


Thr 


Cys 
250 


Glu 


Ala 


Asn 


Met 




CCA 


ACA 


CTC 


GTC 


GAC 


CAA 


GGC 


TTT 


TGG 


ATC 


GGC 


GGC 


CAG 


TAC 


GCT 


CTC 


1175 


Pro 


Thr 


Leu 


Val 


Asp 


Gin 


Gly 


Phe 


Trp 


He 


Gly 


Gly Gin 


Tyr 


Ala 


Leu 




255 










260 










265 








270 




ACC 


CCG 


ACT 


AGC 


CTA 


CCG 


CAG 


TAC 


GAC 


GTC 


AGC 


GAG 


GCC 


TAC 


GCT 


CTG 


1223 


Thr 


Pro 


Thr 


Ser 


Leu 


Pro 


Gin 


Tyr 


Asp 


Val 


Ser 


Glu 


Ala 


Tyr 


Ala 


Leu 












275 










280 








285 






CAC 


ACT 


TTG 


ACC 


TTC 


GCC 


AGA 


CCA 


TCC 


AGC 


GCC 


GCT 


GCA 


CTC 


GCG 


TTT 


1271 


His 


Thr 


Leu 


Thr 


Phe 


Ala 


Arg 


Pro 


Ser 


Ser 


Ala 


Ala 


Ala 


Leu 


Ala 


Phe 





290 295 300 



29 



GTG TGG 
Val Trp 

GCC TGG 
Ala Trp 
320 
GGT ACT 
Gly Thr 
335 

TTC GCC 
Phe Ala 

GCT TCC 
Ala Ser 

GTC GTT 
Val Val 



GCA GGT 
Ala Gly 
305 

GAG CAG 
Glu Gin 

ACT TTC 
Thr Phe 

CTT GAG 
Leu Glu 



TTG CCA CAG 
Leu Pro Gin 



ACC 
Thr 

AAC 
Asn 
415 
ATG 
Met 



GTG 
Val 
400 
ACG 
Thr 

CCA 
Pro 



GCA 
Ala 

GTG 
Val 
385 
AGA 
Arg 



GGA 
Gly 
370 
ACC 
Thr 

GTG 
Val 



GCA TCC 
Ala Ser 

CCA GCT 
Pro Ala 
340 
CGC TAC 
Arg Tyr 
355 

GAC ACC 
Asp Thr 



TCG 
Ser 
325 
GGC 
Gly 

GAC 
Asp 

GTC 
Val 



AAC AAC CCC 
Asn Asn Pro 



CCC CCT 
Pro Pro 



CTC TTA 
Leu Leu 

CCT GCT 
Pro Ala 



GAG CAG 
Glu Gin 

AAA ATG 
Lys Met 

GCC GTT 
Ala Val 
480 
GAC TAC 
Asp Tyr 
495 

GTG GCC 
Val Ala 

ACC TAC 
Thr Tyr 

CAA TTC 
Gin Phe 

GCC GAC 
Ala Asp 
560 
AAC TTC 
Asn Phe 
575 

GTG CTG 
Val Leu 



TCG 
Ser 

CGA 
Arg 
465 
TCC 
Ser 



CTT 
Leu 
450 
AAC 
Asn 

TTC 
Phe 



GAG 
Glu 

GAC 
Asp 
435 
CTT 
Leu 



ACT 
Thr 
420 
TTT 
Phe 

AAA 
Lys 



TCA 
Ser 
405 
CGA 
Arg 

GGA 
Gly 

GAA 
Glu 



GGT GGC ACT 
Gly Gly Thr 
310 

GGT GGC TAC 
Gly Gly Tyr 

TCC GTT AGC 
Ser Val Ser 

CCG AAC GAC 
Pro Asn Asp 
360 

ACT TTC CGG 
Thr Phe Arg 

375 
GCC GGC GGC 
Ala Gly Gly 
390 

AAC GCT TAC 
Asn Ala Tyr 

CCC TCC TCT 
Pro Ser Ser 



GCG CCT 
Ala Pro 

CTC ACC 
Leu Thr 
330 
TAC GTT 
Tyr Val 
345 

GGC TCT 
Gly Ser 

CAG GTC 
Gin Val 

GGC AGC 
Gly Ser 



GCA GGC ACT 
Ala Gly Thr 
315 

TGG CGC CAC 
Trp Arg His 

CTC CCT GAG 
Leu Pro Glu 



ACT GGC 
Thr Gly 

CAC TTC 
His Phe 

CAG GGT 
Gin Gly 
530 
GCG CAC 
Ala His 
545 

GAC CTG 
Asp Leu 

GCG GCC 
Ala Ala 

AAG TCG 
Lys Ser 



CCC GTT TTC 
Pro Val Phe 

AAC AAT CCG 
Asn Asn Pro 
485 

ATC CGT GAC 
lie Arg Asp 

500 
CGC TCA CTC 
Arg Ser Leu 
515 

TGG GAA GGC 
Trp Glu Gly 

GCG GGC CTC 
Ala Gly Leu 



CAG 
Gin 

ACA 
Thr 

CAG 
Gin 
470 
GGT 
Gly 



ACG 
Thr 

CTT 
Leu 
455 
CTC 
Leu 



GTC 
Val 
440 
GGC 
Gly 

ACG 
Thr 



ACC 
Thr 

CGT 
Arg 
425 
GCC 
Ala 



AAC 
Asn 
410 
AGG 
Arg 

AAC 
Asn 



TGG 
Trp 

GCC 
Ala 

GCC 
Ala 
395 
ACC 
Thr ' 



ACC GAC 
Thr Asp 
365 
GTC GAC 
Val Asp 
380 

CCC ACC 
Pro Thr 

GTG TTT 
Val Phe 



CTC GAA CTC 
Leu Glu Leu 



AAC CCG 
Asn Pro 



TAT GAG 
Tyr Glu 



GCC 
Ala 

GCC 
Ala 

GAG 
Glu 
595 



ACC CGT 
Thr Arg 
565 
GTT TCT 
Val Ser 
580 

GCA ACG 
Ala Thr 



TCA TTC GAC 
Ser Phe Asp 

TCC CAC TCC 
Ser His Ser 
520 

GTC ACG AAC 
Val Thr Asn 

535 
CTC AAG AAT 
Leu Lys Asn 
550 

CTC ACA GGT 
Leu Thr Gly 

GCC TTC GCC 
Ala Phe Ala 

TCC TCC ATC 
Ser Ser He 
. 600 



TGC TAT 
Cys Tyr 

CCA GCC 
Pro Ala 

CGC ACA 
Arg Thr 
490 
CAG AAC 
Gin Asn 
505 

TGC AGT 
Cys Ser 

GTC AAC 
Val Asn 

GAG GAG 
Glu Glu 

GTC TAC 
Val Tyr 
570 
GCG AAC 
Ala Asn 
585 

ATC AAG 
He Lys 



TTG 
Leu 

AGC 
Ser 
475 
CGC 
Arg 



GTC 
Val 
460 
TCC 
Ser 

GAC 
Asp 



AAG 
Lys 
445 
CAC 
His 

TTT 
Phe 

CTC 
Leu 



ATG TCC ACC 
Met Ser Thr 

ATC GTC ACT 
He Val Thr 
525 

ACG CCT TTC 
Thr Pro Phe 

540 
ATC CTC TGC 
He Leu Cys 
555 

CCC GCC ACT 
Pro Ala Thr 

ATG CTG TCC 
Met Leu Ser 

TCC GTT GGC 
Ser Val Gly 
605 



CCA 
Pro 

AAC 
Asn 

GGT 
Gly 
350 
TTC 
Phe 

GAG 
Glu 

TTC 
Phe 

AGG 
Arg 

CCT 
Pro 
430 
ATC 
He 

TCC 
Ser 

GGC 
Gly 

CCG 
Pro 

GCT 
Ala 
510 
AAG 
Lys 

GGC 
Gly 

CTC 
Leu 

GAC 
Asp 

TCC 
Ser 
590 
GAG 
Glu 



1319 



1367 



1415 



1463 



1511 



1559 



1607 



1655 



1703 



1751 



1799 



1847 



1895 



1943 



1991 



2039 



2087 



2135 



2183 
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ACT GCC GTC GGC GCG GCT CAG TCC GGC CTC GCG AAG CTA CCC GGA CTG 2231 
Thr Ala Val Gly Ala Ala Gin Ser Gly Leu Ala Lys Leu Pro Gly Leu 

610 615 620 

CTA ATG AGT GTA CCA GGG AAG ATT GCC GCG CGT GTC CGC GCG CGC CGA 2279 
Leu Met Ser Val Pro Gly Lys lie Ala Ala Arg Val Arg Ala Arg Arg 

625 630 635 

GCG CGC CGC CGC GCC GCT CGT GCC AAT TAGTTTGCTC GCTCCTGTTT 2326 
Ala Arg Arg Arg Ala Ala Arg Ala Asn 

640 645 
CGCCGTTTCG TAAAACGGCG TGGTCCCGCA CATTACGCGT ACCCTAAAGA CTCTGGTGAG 2386 
TCCCCGTCGT TACACGACGG GTCTGCCGCG GTTCGATTCC ATTCCCAAGC GGCAAGAAGG 24 4 6 
ACGTAGTTAG CTCTGCGTCC CTCGGGATAC CA 24 78 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 647 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



Met 


Gly 


Asp 


Ala 


Gly 


Val 


Ala 


Ser 


Gin 


Arg 


Pro 


His 


Asn 


Arg Arg 


Gly 


1 








5 










10 










15 




Thr 


Arg 


Asn 


Val 


Arg 


Val 


Ser 


Ala 


Asn 


Thr 


Val 


Thr 


Val 


Asn 


Gly Arg 








20 










25 










30 






Arg 


Asn 


Gin 


Arg 


Arg 


Arg 


Thr 


Gly 


Arg 


Gin 


Val 


Ser 


Pro 


Pro Asp 


Asn 






35 










40 










45 








Phe 


Thr 


Ala 


Ala 


Ala 


Gin 


Asp 


Leu 


Ala 


Gin 


Ser 


Leu 


Asp 


Ala 


Asn 


Thr 




50 










55 










60 








Val 


Thr 


Phe 


Pro 


Ala 


Asn 


He 


Ser 


Ser 


Met 


Pro 


Glu 


Phe 


Arg Asn 


Trp 


65 










70 










75 










80 


Ala 


Lys 


Gly 


Lys 


He 
85 


Asp 


Leu 


Asp 


Ser 


Asp 
90 


Ser 


He 


Gly 


Trp 


Tyr 
95 


Phe 


Lys 


Tyr 


Leu 


Asp 


Pro 


Ala 


Gly 


Ala 


Thr 


Glu 


Ser 


Ala 


Arg 


Ala 


Val 


Gly 








100 










105 








110 






Glu 


Tyr 


Ser 


Lys 


He 


Pro 


Asp 


Gly 


Leu 


Val 


Lys 


Phe 


Ser 


Val 


Asp 


Ala 






115 










120 










125 






Glu 


He 


Arg 


Glu 


He 


Tyr 


Asn 


Glu 


Glu 


Cys 


Pro 


Val 


Val 


Thr 


Asp 


Val 




130 










135 










140 








Ser 


Val 


Pro 


Leu 


Asp 


Gly 


Arg 


Gin 


Trp 


Ser 


Leu 


Ser 


He 


Phe 


Ser 


Phe 


145 










150 










155 










160 


Pro 


Met 


Phe 


Arg 


Thr 


Ala 


Tyr 


Val 


Ala 


Val 


Ala 


Asn 


Val 


Glu 


Asn 


Lys 










165 










170 










175 


Glu 


Met 


Ser 


Leu 


Asp 


Val 


Val 


Asn 


Asp 


Leu 


He 


Glu 


Trp 


Leu 


Asn 


Asn 








180 










185 








190 






Leu 


Ala 


Asp 
195 


Trp 


Arg 


Tyr 


Val 


Val 
200 


Asp 


Ser 


Glu 


Gin 


Trp 
205 


He 


Asn 


Phe 


Thr 


Asn 


Asp 


Thr 


Thr 


Tyr 


Tyr 


Val 


Arg 


He 


Arg 


Val 


Leu 


Arg 


Pro 


Thr 




210 










215 










220 








Tyr 


Asp 


Val 


Pro 


Asp 


Pro 


Thr 


Glu 


Gly 


Leu 


Val 


Arg 


Thr 


Val 


Ser 


Asp 


225 










230 










235 








240 


Tyr 


Arg 


Leu 


Thr 


Tyr 
245 


Lys 


Ala 


He 


Thr 


Cys 
250 


Glu 


Ala 


Asn 


Met 


Pro 
255 


Thr 


Leu 


Val 


Asp 


Gin 
260 


Gly 


Phe 


Trp 


lie 


Gly 
265 


Gly 


Gin 


Tyr 


Ala 


Leu 
270 


Thr 


Pro 


Thr 


Ser 


Leu 


Pro 


Gin 


Tyr 


Asp 


Val 


Ser 


Glu 


Ala 


Tyr 


Ala 


Leu 


His 


Thr 






275 










280 








285 
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Leu Thr Phe Ala 
290 

Ala Gly Leu Pro 
305 

Glu Gin Ala Ser 

Thr Phe Pro Ala 
340 

Leu Glu Arg Tyr 
355 

Ala Gly Asp Thr 
370 

Val Thr Asn Asn 
385 

Arg Val Pro Pro 

Leu Leu Glu Thr 
420 

Pro Ala Asp Phe 
435 

Ser Leu Leu Lys 
450 

Arg Asn Pro Val 
465 

Ser Phe Asn Asn 

Thr Gly He Arg 
500 

His Phe Arg Ser 
515 

Gin Gly Trp Glu 
530 

Ala His Ala Gly 
545 

Asp Leu Ala Thr 

Ala Ala Ala Val 
580 

Lys Ser Glu Ala 
595 

Val Gly Ala Ala 
610 

Ser Val Pro Gly 
625 

Arg Arg Ala Ala 



Arg Pro Ser Ser 
295 

Gin Gly Gly Thr 
310 

Ser Gly Gly Tyr 
325 

Gly Ser Val Ser 

Asp Pro Asn Asp 
360 

Val Thr Phe Arg 
375 

Pro Ala Gly Gly 
390 

Ser Asn Ala Tyr 
405 

Arg Pro Ser Ser 

Gly Gin Thr Val 
440 

Glu Thr Leu Gly 
455 

Phe Gin Leu Thr 
470 

Pro Gly Tyr Glu 
485 

Asp Ser Phe Asp 

Leu Ser His Ser 
520 

Gly Val Thr Asn 
535 

Leu Leu Lys Asn 
550 

Arg Leu Thr Gly 
565 

Ser Ala Phe Ala 

Thr Ser Ser He 
600 

Gin Ser Gly Leu 
615 

Lys He Ala Ala 
630 

Arg Ala Asn 
645 



Ala Ala Ala Leu 
300 

Ala Pro Ala Gly 
315 

Leu Thr Trp Arg 
330 

Tyr Val Leu Pro 
345 

Gly Ser Trp Thr 

Gin Val Ala Val 
380 

Gly Ser Ala Pro 
395 

Thr Asn Thr Val 
410 

Arg Arg Leu Glu 
425 

Ala Asn Asn Pro 

Cys Tyr Leu Val 
460 

Pro Ala Ser Ser 
475 

Arg Thr Arg Asp 
490 

Gin Asn Met Ser 
505 

Cys Ser He Val 

Val Asn Thr Pro 
540 

Glu Glu He Leu 
555 

Val Tyr Pro Ala 
570 

Ala Asn Met Leu 
585 

He Lys Ser Val 

Ala Lys Leu Pro 
620 

Arg Val Arg Ala 
635 



Ala Phe Val Trp 

Thr Pro Ala Trp 
320 

His Asn Gly Thr 
335 

Glu Gly Phe Ala 
350 

Asp Phe Ala Ser 
365 

Asp Glu Val Val 

Thr Phe Thr Val 
400 

Phe Arg Asn Thr 
415 

Leu Pro Met Pro 
430 

Lys He Glu Gin 
445 

His Ser Lys Met 

Phe Gly Ala Val 
480 

Leu Pro Asp Tyr 
495 

Thr Ala Val Ala 
510 

Thr Lys Thr Tyr 
525 

Phe Gly Gin Phe 

Cys Leu Ala Asp 
560 

Thr Asp Asn Phe 
575 

Ser Ser Val Leu 
590 

Gly Glu Thr Ala 
605 

Gly Leu Leu Met 

Arg Arg Ala Arg 
640 



(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
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(B) LOCATION: 283.. 2307 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GTTTTTCTTT CTTTACCAAG TGTGGTAAAA TTTAAACAAA GAAGAAAACC AGGACCGTAA 60 

CCCGGCCCTT ACACACCTCG AGTCCGTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 120 

GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 180 

GTCGACAGAC GCTTCCGGAC CACTAGAACC TCCTCGAGCG ACGCACACAC AGCACACACA 240 

CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TT ATG AGC GAG CAC 294 



Met Ser Glu His 
1 



ACC 


ATC 


GCC 


CAC 


TCC 


ATC 


ACA 


TTA 


CCA 


CCC 


GGT 


TAC 


ACC 


CTT 


GCC 


CTA 


342 


Thr 


He 


Ala 


His 


Ser 


He 


Thr 


Leu 


Pro 


Pro 


Gly Tyr 


Thr 


Leu 


Ala 


Leu 




5 










10 










15 










20 




ATA 


ccc 


CCT 


GAA 


CCT 


GAA 


GCA 


GGA 


TGG 


GAG 


ATG 


CTG 


GAG 


TGG 


CGT 


CAC 


390 


He 


Pro 


Pro 


Glu 


Pro 
25 


Glu 


Ala 


Gly 


Trp 


Glu 
30 


Met 


Leu 


Glu 


Trp 


Arg 
35 


His 




AGC 


GAC 


CTC 


ACA 


ACC 


GTC 


GCG 


GAA CCC 


GTA 


ACG 


TTC 


GGG 


TCA 


GCG 


CCA 


438 


Ser 


Asp 


Leu 


Thr 


Thr 


Val 


Ala 


Glu 


Pro 


Val 


Thr 


Phe 


Gly Ser 


Ala 


Pro 










40 










45 










. 50 








ACA 


CCG 


TCA 


CCG 


TCA 


ATG 


GTA 


GAA 


GAA 


ACC 


AAC 


GGC 


GTC 


GGA 


CCG 


GAA 


486 


Thr 


Pro 


Ser 


Pro 


Ser 


Met 


Val 


Glu 


Glu 


Thr 


Asn 


Gly Val 


Gly 


Pro 


Glu 








55 










60 










65 










GGC 


AAG 


TTT 


CTC 


CCC 


CTG 


ACA 


ATT 


TCA 


CCG 


CTG 


CTG 


CAC 


AAG 


ACC 


TCG 


534 


Gly 


Lys 
70 


Phe 


Leu 


Pro 


Leu 


Thr 
75 


He 


Ser 


Pro 


Leu 


Leu 
80 


His 


Lys 


Thr 


Ser 




CGC 


AAA 


GCC 


TTG 


ACG 


CCA 


ACA- 


CCG 


TCA 


CTT 


TCC 


CCC 


GCT 


AAC 


ATC 


TCT 


582 


Arg 


Lys 


Ala 


Leu 


Thr 


Pro 


Thr 


Pro 


Ser 


Leu 


Ser 


Pro 


Ala 


Asn 


He 


Ser 




85 










90 










95 










100 




AGC 


ATG 


CCC 


GAA 


TTC 


CGG 


AAT 


TGG 


GCC 


AAG 


GGA 


AAG 


ATC 


GAC 


CTC 


GAC 


630 


Ser 


Met 


Pro 


Glu 


Phe 


Arg 


Asn 


Trp Ala 


Lys 


Gly 


Lys 


He 


Asp 


Leu 


Asp 












105 










110 










115 






TCC 


GAT 


TCC 


ATC 


GGC 


TGG 


TAC 


TTC 


AAG 


TAC 


CTT 


GAC 


CCA 


GCG 


GGT 


GCT 


678 


Ser 


Asp 


Ser 


He 


Gly 


Trp 


Tyr 


Phe 


Lys 


Tyr 


Leu 


Asp 


Pro 


Ala 


Gly Ala 










120 










125 










130 








ACA 


GAG 


TCT 


GCG 


CGC 


GCC 


GTC 


GGC 


GAG 


TAC 


TCG 


AAG 


ATC 


CCT 


GAC 


GGC 


726 


Thr 


Glu 


Ser 


Ala 


Arg 


Ala 


Val 


Gly Glu 


Tyr 


Ser 


Lys 


He 


Pro 


Asp 


Gly 








135 










140 










145 










CTC 


GTC 


AAG 


TTC 


TCC 


GTC 


GAC 


GCA 


GAG 


ATA 


AGA 


GAG 


ATC 


TAT 


AAC 


GAG 


774 


Leu 


Val 


Lys 


Phe 


Ser 


Val 


Asp 


Ala 


Glu 


He 


Arg 


Glu 


He 


Tyr 


Asn 


Glu 






150 










155 










160 










GAG 


TGC 


CCC 


GTC 


GTC 


ACT 


GAC 


GTG 


TCC 


GTC 


CCC 


CTC 


GAC 


GGC 


CGC 


CAG 


822 


Glu 


Cys 


Pro 


Val 


Val 


Thr 


Asp 


Val 


Ser 


Val 


Pro 


Leu 


Asp Gly Arg 


Gin 




165 










170 










175 










180 




TGG 


AGC 


CTC 


TCG 


ATT 


TTC 


TCC 


TTT 


CCG 


ATG 


TTC 


AGA 


ACC 


GCC 


TAC 


GTC 


870 


Trp 


Ser 


Leu 


Ser 


He 
185 


Phe 


Ser 


Phe 


Pro 


Met 
190 


Phe 


Arg 


Thr 


Ala 


Tyr 
195 


Val 




GCC 


GTA 


GCG 


AAC 


GTC 


GAG 


AAC 


AAG 


GAG 


ATG 


TCG 


CTC 


GAC 


GTT 


GTC 


AAC 


918 


Ala 


Val 


Ala 


Asn 


Val 


Glu 


Asn 


Lys 


Glu 


Met 


Ser 


Leu 


Asp 


Val 


Val 


Asn 










200 










205 








210 








GAC 


CTC 


ATC 


GAG 


TGG 


CTC 


AAC 


AAT 


CTC 


GCC 


GAC 


TGG 


CGT 


TAT 


GTC 


GTT 


966 


Asp 


Leu 


He 


Glu 


Trp 


Leu 


Asn 


Asn 


Leu 


Ala 


Asp 


Trp Arg 


Tyr 


Val 


Val 








215 










220 










225 








GAC 


TCT 


GAA 


CAG 


TGG 


ATT 


AAC 


TTC 


ACC 


AAT 


GAC 


ACC 


ACG 


TAC 


TAC 


GTC 


1014 


Asp 


Ser 


Glu 


Gin 


Trp 


He 


Asn 


Phe 


Thr 


Asn 


Asp 


Thr 


Thr 


Tyr 


Tyr 


Val 






230 










235 










240 








CGC 


ATC 


CGC 


GTT 


CTA 


CGT 


CCA 


ACC 


TAC 


GAC 


GTT 


CCA 


GAC 


CCC 


ACA 


GAG 


1062 


Arg 


He 


Arg 


Val 


Leu 


Arg 


Pro 


Thr 


Tyr 


Asp 


Val 


Pro 


Asp 


Pro 


Thr 


Glu 
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245 250 255 260 

GGC CTT GTT CGC ACA GTC TCA GAC TAC CGC CTC ACT TAT AAG GCG ATA 1110 

Gly Leu Val Arg Thr Val Ser Asp Tyr Arg Leu Thr Tyr Lys Ala He 

265 270 275 

ACA TGT GAA GCC AAC ATG CCA ACA CTC GTC GAC CAA GGC TTT TGG ATC 1158 
Thr Cys Glu Ala Asn Met Pro Thr Leu Val Asp Gin Gly Phe Trp He 

280 285 290 

GGC GGC CAG TAC GCT CTC ACC CCG ACT AGC CTA CCG CAG TAC GAC GTC 1206 
Gly Gly Gin Tyr Ala Leu Thr Pro Thr Ser Leu Pro Gin Tyr Asp Val 

295 300 305 

AGC GAG GCC TAC GCT CTG CAC ACT TTG ACC TTC GCC AGA CCA TCC AGC 1254 
Ser Glu Ala Tyr Ala Leu His Thr Leu Thr Phe Ala Arg Pro Ser Ser 

310 315 320 

GCC GCT GCA CTC GCG TTT GTG TGG GCA GGT TTG CCA CAG GGT GGC ACT 1302 
Ala Ala Ala Leu Ala Phe Val Trp Ala Gly Leu Pro Gin Gly Gly Thr 
325 330 335 340 

GCG CCT GCA GGC ACT CCA GCC TGG GAG CAG GCA TCC TCG GGT GGC TAC 1350 
Ala Pro Ala Gly Thr Pro Ala Trp Glu Gin Ala Ser Ser Gly Gly Tyr 

345 350 355 

CTC ACC TGG CGC CAC AAC GGT ACT ACT TTC CCA GCT GGC 'TCC GTT AGC 1398 
Leu Thr Trp Arg His Asn Gly Thr Thr Phe Pro Ala Gly Ser Val Ser 

360 365 370 

TAC GTT CTC CCT GAG GGT TTC GCC CTT GAG CGC TAC GAC CCG AAC GAC 14 4 6 

Tyr Val Leu Pro Glu Gly Phe Ala Leu Glu Arg Tyr Asp Pro Asn Asp 

375 380 385 

GGC TCT TGG ACC GAC TTC GCT- TCC GCA GGA GAC ACC GTC ACT TTC CGG 14 94 

Gly Ser Trp Thr Asp Phe Ala Ser Ala Gly Asp Thr Val Thr Phe Arg 

390 395 400 

CAG GTC GCC GTC GAC GAG GTC GTT GTG ACC AAC AAC CCC GCC GGC GGC 1542 
Gin Val Ala Val Asp Glu Val Val Val Thr Asn Asn Pro Ala Gly Gly 
405 410 415 420 

GGC AGC GCC. CCC ACC TTC ACC GTG AGA GTG CCC CCT TCA AAC GCT TAC 1590 
Gly Ser Ala Pro Thr Phe Thr Val Arg Val Pro Pro Ser Asn Ala Tyr 

425 430 435 

ACC AAC ACC GTG TTT AGG AAC ACG CTC TTA GAG ACT CGA CCC TCC TCT 1638 
Thr Asn Thr : Val Phe Arg Asn Thr Leu Leu Glu Thr Arg Pro Ser Ser 

440 445 450 

CGT AGG CTC GAA CTC CCT ATG CCA CCT GCT GAC TTT GGA CAG ACG GTC 1686 
Arg Arg Leu Glu Leu Pro Met Pro Pro Ala Asp Phe Gly Gin Thr Val 

455 460 465 

GCC AAC AAC CCG AAG ATC GAG CAG TCG CTT CTT AAA GAA ACA CTT GGC 1734 
Ala Asn Asn Pro Lys He Glu Gin Ser Leu Leu Lys Glu Thr Leu Gly 

470 475 480 

TGC TAT TTG GTC CAC TCC AAA ATG CGA AAC CCC GTT TTC CAG CTC ACG 1782 
Cys Tyr Leu Val His Ser Lys Met Arg Asn Pro Val Phe Gin Leu Thr 
485 490 495 500 

CCA GCC AGC TCC TTT GGC GCC GTT TCC TTC AAC AAT CCG GGT TAT GAG 1830 
Pro Ala Ser Ser Phe Gly Ala Val Ser Phe Asn Asn Pro Gly Tyr Glu 

505 510 515 

CGC ACA CGC GAC CTC CCG GAC TAC ACT GGC ATC CGT GAC TCA TTC GAC 1878 
Arg Thr Arg Asp Leu Pro Asp Tyr Thr Gly He Arg Asp Ser Phe Asp 

520 525 530 

CAG AAC ATG TCC ACC GCT GTG GCC CAC TTC CGC TCA CTC TCC CAC TCC 1926 
Gin Asn Met Ser Thr Ala Val Ala His Phe Arg Ser Leu Ser His Ser 

535 540 545 

TGC AGT ATC GTC ACT AAG ACC TAC CAG GGT TGG GAA GGC GTC ACG AAC 1974 
Cys Ser lie Val Thr Lys Thr Tyr Gin Gly Trp Glu Gly Val Thr Asn 
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550 555 560 

GTC AAC ACG CCT TTC GGC CAA TTC GCG CAC GCG GGC CTC CTC AAG AAT 2022 
Val Asn Thr Pro Phe Gly Gin Phe Ala His Ala Gly Leu Leu Lys Asn 
565 570 575 580 

GAG GAG ATC CTC TGC CTC GCC GAC GAC CTG GCC ACC CGT CTC ACA GGT 2070 
Glu Glu lie Leu Cys Leu Ala Asp Asp Leu Ala Thr Arg Leu Thr Gly 

585 590 595 

GTC TAC CCC GCC ACT GAC AAC TTC GCG GCC GCC GTT TCT GCC TTC GCC 2118 
Val Tyr Pro Ala Thr Asp Asn Phe Ala Ala Ala Val Ser Ala Phe Ala 

600 605 610 

GCG AAC ATG CTG TCC TCC GTG CTG AAG TCG GAG GCA ACG TCC TCC ATC 2166 
Ala Asn Met Leu Ser Ser Val Leu Lys Ser Glu Ala Thr Ser Ser lie 

615 620 625 

ATC AAG TCC GTT GGC GAG ACT GCC GTC GGC GCG GCT CAG TCC GGC CTC 2214 
He Lys Ser Val Gly Glu Thr Ala Val Gly Ala Ala Gin Ser Gly Leu 

630 635 640 

GCG AAG CTA CCC GGA CTG CTA ATG AGT GTA CCA GGG AAG ATT GCC GCG 2262 
Ala Lys Leu Pro Gly Leu Leu Met Ser Val Pro Gly Lys He Ala Ala 
645 650 655 660 

CGT GTC CGC GCG CGC CGA GCG CGC CGC CGC GCC GCT CGT GCC AAT 2307 
Arg Val Arg Ala Arg Arg Ala Arg Arg Arg Ala Ala Arg Ala Asn 
665 670 675 

TAGTTTGCTC GCTCCTGTTT CGCCGTTTCG TAAAACGGCG TGGTCCCGCA CATTACGCGT 2367 
ACCCTAAAGA CTCTGGTGAG TCCCCGTCGT TACACGACGG GTCTGCCGCG GTTCGATTCC 2427 
ATTCCCAAGC GGCAAGAAGG ACGTAGTTAG CTCTGCGTCC CTCGGGATAC CA 24 7 9 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Met Ser Glu His Thr He Ala His Ser He Thr Leu Pro Pro Gly Tyr 

15 10 15 

Thr Leu Ala Leu He Pro Pro Glu Pro Glu Ala Gly Trp Glu Met Leu 

20 25 30 

Glu Trp Arg His Ser Asp Leu Thr Thr Val Ala Glu Pro Val Thr Phe 

35 40 45 

Gly Ser Ala Pro Thr Pro Ser Pro Ser Met Val Glu Glu Thr Asn Gly 

50 55 60 

Val Gly Pro Glu Gly Lys Phe Leu Pro Leu Thr He Ser Pro Leu Leu 
65 70 75 80 

His Lys Thr Ser Arg Lys Ala Leu Thr Pro Thr Pro Ser Leu Ser Pro 

85 90 95 

Ala Asn lie Ser Ser Met Pro Glu Phe Arg Asn Trp Ala- Lys Gly Lys 

100 105 110 

He Asp Leu Asp Ser Asp Ser He Gly Trp Tyr Phe Lys Tyr Leu Asp 

115 120. 125 

Pro Ala Gly Ala Thr Glu Ser Ala Arg Ala Val Gly Glu Tyr Ser Lys 

130 135 140 

He Pro Asp Gly Leu Val Lys Phe Ser Val Asp Ala Glu He Arg Glu 
145 150 155 160 

He Tyr Asn Glu Glu Cys Pro Val Val Thr Asp Val Ser Val Pro Leu 

165 170 175 

Asp Gly Arg Gin Trp Ser Leu Ser He Phe Ser Phe Pro Met Phe Arg 
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Thr Ala 

Asp Val 
210 
Arg Tyr 
225 

Thr Tyr 

Asp Pro 

Tyr Lys 

Gly Phe 
290 
Gin Tyr 
305 

Arg Pro 

Gin Gly 

Ser Gly 

Gly Ser 
370 
Asp Pro 
385 

Val Thr 

Pro Ala 

Ser Asn 

Arg Pro 
450 
Gly Gin 
465 

Glu Thr 

Phe Gin 

Pro Gly 

Asp Ser 
530 
Leu Ser 
545 

Gly Val 

Leu Leu 

Arg Leu 

Ser Ala 
610 
Thr Ser 
625 



180 
Tyr Val Ala 
195 

Val Asn Asp 

Val Val Asp 

Tyr Val Arg 
245 

Thr Glu Gly 
260 

Ala He Thr 
275 

Trp He Gly 

Asp Val Ser 

Ser Ser Ala 
325 

Gly Thr Ala 

340 
Gly Tyr Leu 
355 

Val Ser Tyr 

Asn Asp Gly 

Phe Arg Gin 
405 

Gly Gly Gly 
420 

Ala Tyr Thr 
435 

Ser Ser Arg 

Thr Val Ala 

Leu Gly Cys 
485 

Leu Thr Pro 

500 
Tyr Glu Arg 
515 

Phe Asp Gin 

His Ser Cys 

Thr Asn Val 
565 

Lys Asn Glu 

580 
Thr Gly Val 
595 

Phe Ala Ala 



Val Ala 

Leu He 
215 
Ser Glu 
230 

He Arg 

Leu Val 

Cys Glu 

Gly Gin 
295 
Glu Ala 
310 

Ala Ala 

Pro Ala 

Thr Trp 

Val Leu 
375 
Ser Trp 
390 

Val Ala 

Ser Ala 

Asn Thr 

Arg Leu 
455 
Asn Asn 
470 

Tyr Leu 

Ala Ser 

Thr Arg 

Asn Met 
535 
Ser He 
550 

Asn Thr 
Glu He 
Tyr Pro 



185 
Asn Val 
200 

Glu Trp. 

Gin Trp 

Val Leu 

Arg Thr 
265 
Ala Asn 
280 

Tyr Ala 

Tyr Ala 

Leu Ala 

Gly Thr 
345 
Arg His 
360 

Pro Glu 

Thr Asp 

Val Asp 

Pro Thr 
425 
Val Phe 
440 

Glu Leu 

Pro Lys 

Val His 

Ser Phe 
505 
Asp Leu 
520 

Ser Thr 
Val Thr 
Pro Phe 



Ser He He 



Asn Met 
615 
Lys Ser 
630 



Leu Cys 
585 
Ala Thr 
600 

Leu Ser 
Val Gly 



Glu Asn 

Leu Asn 

lie Asn 
235 
Arg Pro 
250 

Val Ser 

Met Pro 

Leu Thr 

Leu His 
315 
Phe Val 
330 

Pro Ala 

Asn Gly 

Gly Phe 

Phe Ala 
395 
Glu Val 
410 

Phe Thr 

Arg Asn 

Pro Met 

He Glu 
475 
Ser Lys 
490 

Gly Ala 

Pro Asp 

Ala Val 

Lys Thr 
555 
Gly Gin 
570 

Leu Ala 



Lys Glu 
205 
Asn Leu 
220 

Phe Thr 

Thr Tyr 

Asp Tyr 

Thr Leu 
285 
Pro Thr 
300 

Thr Leu 

Trp Ala 

Trp Glu 

Thr Thr 
365 
Ala Leu 
380 

Ser Ala 

Val Val 

Val Arg 

Thr Leu 
445 
Pro Pro 
4 60 

Gin Ser 

Met Arg 

Val Ser 

Tyr Thr 
525 
Ala His 
540 

Tyr Gin 
Phe Ala 
Asp Asp 



Asp Asn 

Ser Val 

Glu Thr 
635 



Phe Ala 
605 
Leu Lys 
620 

Ala Val 



190 

Met Ser Leu 

Ala Asp Trp 

Asn Asp Thr 
240 

Asp Val Pro 

255 
Arg Leu Thr 
270 

Val Asp Gin 

Ser Leu Pro 

Thr Phe Ala 
320 

Gly Leu Pro 

335 
Gin Ala Ser 
350 

Phe Pro Ala 

Glu Arg Tyr 

Gly Asp Thr 
400 

Thr Asn Asn 

415 
Val Pro Pro 
4 30 

Leu Glu Thr 

Ala Asp Phe 

Leu Leu Lys 
480 

Asn Pro Val 

495 
Phe Asn Asn 
510 

Gly He Arg 

Phe Arg Ser 

Gly Trp Glu 
560 

His Ala Gly 

575 
Leu Ala Thr 
590 

Ala Ala Val 

Ser Glu Ala 

Gly Ala Ala 
640 
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Gin Ser Gly Leu. Ala Lys Leu Pro Gly Leu Leu Met Ser Val Pro Gly 

645 650 655 

Lys He Ala Ala Arg Val Arg Ala Arg Arg Ala Arg Arg Arg Ala Ala 
660 665 670 

Arg Ala Asn 
675 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGGGATCCAC AGTTCTGCCT CCCCCGGACG GTAAATATAG GGGAACCATG GTCTAGAGG 
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